RIBOZYME WITH tRNA SYNTHETASE ACTIVITY AND METHODS OF MANUFACTURING AND USING THE SAME

ABSTRACT

Ribozymes exhibiting tRNA synthetase activity and substrate specificity, as well as methods for engineering and producing the same, are disclosed. The ribozymes of the present disclosure comprise a T-box module fused with a flexizyme module. The flexizyme module provides high promiscuity with respect to amino acid substrates and the T-box module provides tRNA substrate specificity, which may be engineered as desired. Systems are also described for aminoacylation of suppressor tRNAs with unnatural amino acids (uAAs), such systems comprising the ribozyme previously mentioned, suppressor tRNA, and the desired uAA(s). Methods for incorporating a uAA into a growing polypeptide chain using the ribozyme hereof are also provided.

PRIORITY

The present application is related to, claims the priority benefit of, and is a continuation-in-part of U.S. patent application Ser. No. 15/209,422 to Golden et al., filed Jul. 13, 2016, and issued as U.S. Pat. No. 10,113,168 on Oct. 30, 2018, which is related to and claims the priority benefit of U.S. Provisional Patent Application Ser. No. 62/166,680 to Golden et al., filed May 27, 2015. The entire contents of each of the aforementioned priority applications are hereby expressly incorporated by reference in their entireties into this disclosure.

BACKGROUND

Proteins perform a vast array of functions within living organisms and are the chief actors within a cell. Examples of such functionality include catalyzing metabolic/chemical reactions (i.e. enzymes), cell signaling and signal transduction, DNA replication, providing structural support (i.e. structural proteins), and transporting molecules from one location to another. Proteins differ from one another primarily with respect to their sequence of amino acids, which is dictated by the nucleotide sequence of their genes and typically results in protein folding into a specific three-dimensional (3D) structure that determines its activity.

In basic ribosomal protein biosynthesis, messenger RNA (mRNA) encodes a protein through the process of translation. In brief, ribosome molecules read down the length of mRNA codons (each a sequence of three-nucleotides) and translate the genetic information contained therein to a specific sequence of amino acids by facilitating complementary base pairing to the complementary transfer RNA (tRNA) anticodons.

Each mRNA codon is recognized by a particular tRNA. During translation, each of the tRNA molecules that bind the mRNA is “charged,” meaning that it is carrying a specific amino acid via a covalent bond. As such, when a particular tRNA binds with its complementary codon on the mRNA at the ribosome, its cargo amino acid is lined up with the amino acid of the tRNA corresponding to the next codon on the mRNA. Thereafter, a peptide bond forms between the amino acids and the tRNA releases its amino acid, thus forming a chain of amino acids—or a polypeptide—as the mRNA pass through and are read by the ribosome. Termination of the polypeptide happens when the ribosome hits a stop codon in the mRNA that ends the translation process. The polypeptide is released and folds into its dictated 3D geometry. Accordingly, the specific nucleotide sequence of an mRNA specifies which amino acids are incorporated into the protein product, with the role of tRNA being to specify which amino acids correspond with the sequence from the mRNA. The particular sequence of amino acid in a protein product has a direct effect as to the resulting structure and, thus, functionality of the protein.

Conventionally, each type of tRNA molecule can be attached to only one type of amino acid. The covalent attachment between the tRNAs and their specific amino acids is catalyzed by aminoacyl tRNA synthetases. Typically, aminoacyl tRNA synthetases are extremely specific with respect to tRNA and the related amino acid. Accordingly, a specific tRNA substrate will only take on a particular amino acid when aminoacylated with the correct aminoacyl tRNA synthetase

Through protein engineering techniques, the natural translation process can be manipulated to study protein structure and function, as well as for protein modification. Indeed, protein engineering has become an extensively used tool in molecular biology, with methods for incorporating even unnatural amino acids (uAAs) into proteins to develop unique functionalities and/or improved protein function. For example, the site-specific introduction of uAAs can be used to probe enzyme mechanisms, increase acidity, localize proteins within cells (through adding a fluorescent label or otherwise), improve the therapeutic properties of drugs, and the like. Because of the potential in this area, there is significant interest in expanding the chemical diversity of proteins beyond the twenty (20) amino acids most commonly incorporated during ribosomal protein synthesis.

The most common conventional method of introducing an uAA during protein biosynthesis employs a functional pair of tRNA and aminoacyl tRNA synthetase (“ARS”)—an orthogonal set—to act independently of the endogenous aminoacylation machinery of the cell. Specifically, the ARS is engineered to charge a tRNA (for example, an amber suppressor tRNA) with the particular uAA of interest while the tRNA recognizes a specialized codon within the mRNA (typically an amber STOP codon (UAG, for example) or a four-base codon) that does not code for one of the natural amino acids. Accordingly, the charged tRNA delivers the uAA to the ribosome for protein synthesis and, in doing so, uniquely introduces the uAA into a protein at the desired site.

To be introduced selectively at its predetermined position only, the orthogonal set must not crosstalk with the endogenous tRNA and synthetase sets, while remaining functionally compatible with the ribosome and other components of the translation apparatus. This is problematic for several reasons, one of which being that many tRNA synthetases recognize the anticodon loop of the tRNA and thus cannot be used to charge a tRNA that recognizes a stop codon. Furthermore, the active site of the ARS must be capable of accommodating the uAA of interest. As proteins normally have exquisite specificity for their substrates, this can significantly limit the identity of the uAA accepted.

Orthogonal tRNA synthetases are conventionally generated using archaeal proteins that, when introduced into prokaryotes such as E. coli, can discriminate between all of the tRNA substrates available in a cell or in an in vitro translation system and bind only to its orthogonal tRNA partner. In this manner, only the appropriate tRNA substrate is charged with the uAA. However, Eukarya is the most complex domain of life, with not only more tRNA gene content, but also higher variation. While conventional orthogonal methodologies have had some success in prokaryotic and in vitro translation systems, a robust mechanism to introduce uAAs into eukaryotes has yet to be established. Indeed, the foreign ARS and tRNA conventional pairs are typically not successful when used in vivo in eukaryotic cells, as there is a substantial risk the orthogonal set will recognize—and charge—their homologs derived from the host organism. Furthermore, as previously stated, significant engineering is required to redesign the orthogonal ARS such that it will accept the uAA, and each desired modification within a protein potentially requires additional engineering of the ARS. The complexity of ARS engineering limits the chemical diversity of which uAAs can be incorporated.

Accordingly, what is needed is an efficient and effective system capable of selecting and selectively charging tRNA with a wide variety of uAAs within a eukaryotic cell.

BRIEF SUMMARY

The present disclosure provides artificial ribozymes comprising a T-box element and a flexizyme (an amino-acylating ribozyme) and methods for producing the same. Also disclosed are methods for incorporating an unnatural amino acid into a protein using the novel artificial ribozymes described herein and synthesis kits for producing peptides that incorporate one or more amino acids at one or more specific sites that utilize the artificial ribozymes hereof.

In at least one exemplary embodiment of the present disclosure, an artificial ribozyme is provided, the ribozyme comprising a T-box element and a flexizyme. The flexizyme comprises an active site for binding an unnatural amino acid and charging a tRNA molecule through a CCA-tail at its 3′-terminus, and the T-box element recognizes and preferentially binds the body of a tRNA substrate. Furthermore, the flexizyme is linked with the T-box element such that the flexizyme can bind and aminoacylate a tRNA substrate bound by the T-box element. The T-box element of the ribozyme may be linked to the flexizyme through a P1 stem of the flexizyme. Additionally or alternatively, the active site of the flexizyme is not specific to a targeted unnatural amino acid.

In at least one exemplary embodiment, the T-box element may be a bacterial T-box element derived from a Geobacillus kaustophilus and the flexizyme may comprise a dinitro-flexizyme. Furthermore, the flexizyme may comprise a circular permutation formed by a link between an original 3′-terminus end of the flexizyme and an original 5′-terminus end of the flexizyme, wherein a P1 loop of the flexizyme is open to create a new 5′-terminus end and a new 3′-terminus end and comprises SEQ ID NO: 2. In some embodiments of the aforementioned circular permutation, the original 5′-terminus end and the original 3′-terminus end of the flexizyme may be joined by a linker comprised of seven, eight, or ten nucleotides. Alternatively, the flexizyme may comprise a linear permutation formed by a link between a 3′-terminus end of the T-box element and a 5′-terminus end of the flexizyme, there the link comprising a poly-A linker comprising between five and ten nucleotides. Still further, the ribozyme may comprise SEQ ID No: 1 and/or a 3′-terminus of the T-box element may be linked to a 5′-terminus of a circularly permuted flexizyme such as that shown in SEQ ID NO: 2.

Still further, in at least one additional embodiment, an artificial ribozyme comprising a T-box element comprising a 3′-terminus and a specifier loop for binding a cognate tRNA substrate with specificity and a flexizyme comprising dinitro-flexizyme, Fx3, or a variant of Fx3 is also provided. The flexizyme of such artificial ribozyme may define at least an active site for binding an amino acid, an acceptor end for base pairing a terminus of a tRNA substrate, a nucleotide linker, and a P1 stem linked to the 3′-terminus of the T-box element. There, the T-box element recognizes and preferentially binds the cognate tRNA substrate, the flexizyme is capable of binding and aminoacylating the cognate tRNA substrate bound by the T-box element, and the active site of the flexizyme is not specific to one or more targeted amino acids. The one or more targeted amino acids may comprise a natural amino acid, an unnatural amino acid, or both.

The T-box element of the artificial ribozymes disclosed herein may further comprise a mutation that is complimentary to an anticodon of the cognate tRNA substrate. For example, in at least one embodiment, the mutation may be at a location in the T-box element that corresponds with an identity element for determining a degree of orthogonality of the cognate tRNA substrate. In this manner, the T-box may be specifically designed to be specific to a particular tRNA substrate.

In at least one embodiment, the mutation may be at the specifier loop of the T-box element and may, in certain embodiments, comprise a stop codon comprising an uracil-adenine-guanine nucleotide sequence. Additionally or alternatively, the cognate tRNA substrate may comprise an amber suppressor tRNA. Furthermore, the flexizyme of the artificial ribozyme may comprise SEQ ID NO: 2, SEQ ID NO: 3, or a functional equivalent of either SEQ ID NO: 2 or SEQ ID NO: 3.

Methods for producing an artificial ribozyme are also provided. In at least one embodiment, the method comprises the steps of: attaching a 3′-terminus end of a T-box element with a 5′-terminus end of a circularly-permuted flexizyme such that the flexizyme can aminoacylate a tRNA substrate that is bound by the T-box element; binding, with specificity, an anticodon of a preferred tRNA substrate with a specifier loop of the T-box element; binding a 3′-terminus end of the preferred tRNA substrate with an acceptor end of the flexizyme; and aminoacylating the preferred tRNA substrate with an unnatural amino acid bound to an active site of the flexizyme. Additional embodiments of the methods described herein may further comprise the step of optimizing a rate at which the step of aminoacylating the preferred tRNA substrate is performed by modifying a discriminant base of the 3′-terminus end of the preferred tRNA substrate to complement a base of the acceptor end of the flexizyme. Additionally or alternatively, the methods may further comprise the step of modifying a binding specificity of the T-box element by adding or removing base pairs from a P1 stem of the flexizyme. Still further, certain embodiments comprise the step of modifying aminoacylation efficiency of the flexizyme by adding or removing base pairs from a P1 stem of the flexizyme.

Methods for incorporating an unnatural amino acid into a protein are also disclosed. In at least one embodiment, the method comprises the steps of: providing an artificial ribozyme, a preferred tRNA substrate, and an unnatural amino acid substrate to a translation system, the translation system comprising translation components derived from a bacterial, archeaeal, or eukaryotic organism and the ribozyme comprising: a binding element with specificity for binding a preferred tRNA substrate, and a catalytic element linked to the binding element, the catalytic element for catalyzing aminoacylation of a tRNA substrate bound by both the binding element and the catalytic element, and the catalytic element comprising an active site for binding an unnatural amino acid; binding, with specificity, an anticodon of the preferred tRNA substrate with the binding element of the ribozyme; binding a 3′-terminus end of the preferred tRNA substrate with the catalytic element of the ribozyme; reacting the catalytic element with the preferred tRNA substrate under conditions such that an unnatural amino acid substrate bound to the active site of the catalytic element is released and a bond is formed between the preferred tRNA substrate and the amino acid substrate to produce a charged tRNA; and producing a protein, wherein the unnatural amino acid is incorporated into the protein.

In at least one embodiment of the above-described method, the step of providing further comprises delivering the artificial ribozyme, the preferred tRNA substrate, and an amino acid substrate to a cell on a single plasmid. Additional embodiments comprise the step of releasing the charged tRNA from the artificial ribozyme. In certain embodiments, the T-box element of the ribozyme is orthogonal with respect to the translation components derived from the bacterial, archeael, or eukaryotic organism and/or the preferred tRNA substrate may be a suppressor tRNA. Furthermore, the cell may be from an organism including, without limitation, a cell selected from the group consisting of a bacterial cell, a yeast cell, an insect cell, and a mammalian cell, and the method is performed in vivo.

Additionally or alternatively, the preferred tRNA substrate may be provided in a pool of variant tRNA substrates and the method step of binding, with specificity, an anticodon of the preferred tRNA substrate may further comprise selecting the preferred tRNA substrate from a pool comprising the preferred tRNA substrate and endogenous tRNA substrates. There, the pool may also further comprise one or more aminoacyl synthetase/tRNA pairs, where the tRNA of such pair(s) are not the preferred tRNA substrate.

Synthesis kits for producing peptides that incorporate one or more amino acids at one or more specific sites are also provided in the present disclosure. In at least one exemplary embodiment, such a kit comprises a first cognate tRNA substrate, a first artificial ribozyme comprising one or more of the artificial ribozymes described herein, a first targeted amino acid, and one or more reagents for conducting in vitro translation. In at least one embodiment of the kit, the first artificial ribozyme, the first cognate tRNA substrate, and the first targeted amino acid comprise a first group and at least one of the members of that first group is housed in a first container separate from the other members of the group. Still further, the first targeted amino acid may comprise an unnatural amino acid or a natural amino acid.

The T-box element of the first artificial ribozyme of the kit recognizes and preferentially binds the first cognate tRNA substrate. For example, and without limitation, the T-box element of the first artificial ribozyme may comprise a mutation from wild-type, the mutation complimentary to an anticodon of the first cognate tRNA substrate. In at least one exemplary embodiment of the present disclosure, the mutation of the T-box element is at the specifier loop, the first cognate tRNA substrate comprises an amber suppressor tRNA, and the mutation of the T-box element is complimentary to an anticodon of the amber suppressor tRNA.

In at least one embodiment, the flexizyme of the first artificial ribozyme of the kit comprises SEQ ID NO: 2, SEQ ID NO: 3, or a functional equivalent of either SEQ ID NO: 2 or SEQ ID NO: 3.

The synthesis kit of the present disclosure may further comprise a second cognate tRNA substrate and a second artificial ribozyme (that is either the same as the first artificial ribozyme within the kit or comprising a variant of the artificial ribozyme disclosed herein). Here, the T-box element of the second artificial ribozyme again recognizes and preferentially binds the second cognate tRNA substrate. Such kit may also comprise a second targeted amino acid. The second artificial ribozyme, the second cognate tRNA substrate, and the second targeted amino acid may comprise a second group, where at least one of the members of the second group is housed in a second container that is separate from the other members of the second group.

Methods for synthesizing peptides that incorporate one or more amino acids at a specific site are also provided. In at least one exemplary embodiment, such a method comprises the steps of: providing a tRNA substrate, a first amino acid, and an artificial ribozyme; aminoacylating the tRNA substrate with the first amino acid to produce a first aminoacylated tRNA substrate by combining the artificial ribozyme with the first amino acid such that the active site of the flexizyme binds with the first amino acid, combining the artificial ribozyme with the tRNA substrate such that the specifier loop of the T-box element binds, with specificity, an anticodon on the tRNA substrate and a 3′-terminus of the tRNA substrate binds the acceptor end of the flexizyme, and reacting the bound tRNA substrate with the bound first amino acid; and delivering the first aminoacylated tRNA substrate to a ribosome.

In such methods, the artificial ribozyme may comprise a T-box element comprising a 3′-terminus and a specifier loop for binding the tRNA substrate with specificity, and a flexizyme comprising dinitro-flexizyme, Fx3, or a variant of Fx3 and defines at least an active site for binding an amino acid, an acceptor end for base pairing to a terminus of the tRNA substrate, a nucleotide linker, and a P1 stem linked to the 3′-terminus of the T-box element. When the T-box element recognizes and preferentially binds the tRNA substrate, the flexizyme is capable of binding and aminoacylating the tRNA substrate bound by the T-box element, and the active site of the flexizyme is not specific to a targeted amino acid. In at least one exemplary embodiment, the T-box element of the artificial ribozyme comprises a mutation that is complimentary to an anticodon of the tRNA substrate and comprises a tyrS T-box. Additionally, the mutation may be at the specifier loop of the T-box element, the tRNA substrate may comprise an amber suppressor tRNA, and the first amino acid may comprise an unnatural amino acid.

In at least one embodiment of the methods described herein, at least the steps of aminoacylating the tRNA substrate and delivering the first aminoacylated tRNA substrate to a ribosome may be performed in vitro. Optionally, there, the step of delivering the first aminoacylated tRNA substrate to a ribosome may comprise exposure to one or more reagents for conducting in vitro translation. Additionally or alternatively, the T-box element may comprise SEQ ID NO: 3 or a functional equivalent thereof.

Such methods may further comprise the steps of: providing a second amino acid; aminoacylating the tRNA substrate with the second amino acid to produce a second aminoacylated tRNA substrate by combining the artificial ribozyme with the second amino acid such that the active site of the flexizyme binds with the second amino acid, combining the artificial ribozyme with the tRNA substrate such that the specifier loop of the T-box element binds, with specificity, an anticodon on the tRNA substrate and a 3′-terminus of the tRNA substrate binds the acceptor end of the flexizyme, and reacting the bound tRNA substrate with the bound second amino acid; and delivering the second aminoacylated tRNA substrate to a ribosome.

BRIEF DESCRIPTION OF THE SEQUENCE LISTING

SEQ ID NO: 1 is an RNA sequence of at least one exemplary embodiment of a ribozyme according to the present disclosure, such ribozyme having tRNA synthetase activity:

5′-GAGUCGCGAUGACGGAUCAAUAGUAGUUAACCCUCUCUUCCGAAG CGAGCCGGGGGCGGUGGGAGCCCGGUGAAGACGGUUAAUGAAACGGCA GUCCGGAGCGAACAUGACGAAAGUGGGUGCGCGUUUGGCGCAUCAAGA UCCCCGCAUCCCCGAAAGGGUACAUGGCGUUAGGUAAAAAAAAGGGAU C-3′; and

SEQ ID NO: 2 is an artificial sequence of at least one exemplary embodiment of a circularly permuted flexizyme according to the present disclosure, such flexizyme comprising an active site for binding an unnatural amino acid and able to charge a tRNA molecule with the unnatural amino acid:

5′-GAUCCCCGCAUCCCCGAAAGGGUACAUGGCGUUAGGUAAAAAAAA GGGAUC-3′; and

SEQ ID NO: 3 is an artificial sequence of at least one exemplary embodiment of a ribozyme according to the present disclosure, such ribozyme having tRNA synthetase activity with an amber suppressor tRNA substrate, and capable of functioning in an in vitro translation system:

5′-GUGGUAAAGAUUGAGACAAGUAGAAUAUCCUUACGUUCCAGAGAG CUGAUGGCCGGUGAAAAUCAGCACAGACGGAUAUAUCGAAUAGACUCA UGAACCGGAUCCCCGCAUCCCCGAAAGGGUACAUGGCGUUAGGUAAAA AAAAGGGAUC-3′.

In addition to the foregoing, a written Sequence Listing for the above-described artificial sequences is appended hereto and the same Sequence Listing is provided in computer readable form encoded in a file filed herewith and herein incorporated by reference. The information recorded in computer readable form is identical to the written Sequence Listing provided herein, pursuant to 37 C.F.R. § 1.821(f).

BRIEF DESCRIPTION OF THE DRAWINGS

The disclosed embodiments and other features, advantages, and aspects contained herein, and the matter of attaining them, will become apparent in light of the following detailed description of various exemplary embodiments of the present disclosure. Such detailed description will be better understood when taken in conjunction with the accompanying drawings, wherein:

FIG. 1A illustrates the binding of a tRNA and an activated amino acid (dintrobenzyl ester (DBE) by a flexizyme;

FIG. 1B shows a gel displaying the results of charging a tRNA with a variety of amino acids using the flexizyme of FIG. 1A;

FIG. 2 illustrates a T-box riboswitch binding a tRNA, with subpart (A) illustrating that when the tRNA is charged with an amino acid, there is no transcription and the gene is turned off and subpart (B) illustrating that when the tRNA is uncharged, it interacts with the antiterminator helix and the gene is turned on;

FIG. 3A illustrates the recognition of the anticodon and CCA tail of the tRNA by the T-box RNA;

FIG. 3B illustrates a model of interaction of the elements of at least one embodiment of a ribozyme of the present disclosure;

FIG. 4A shows the sequence and secondary structure model of at least one exemplary embodiment of the ribozyme comprising SEQ ID NO: 1 of FIG. 3B;

FIGS. 4B and 4C show the sequences and secondary structure models of an unmodified dFx flexizyme module (FIG. 4B) and at least one embodiment of the circularly permuted flexizyme (FIG. 4C) disclosed herein, comprising SEQ ID NO: 2, and used to generate the ribozyme of FIGS. 3B and 4A;

FIG. 4D shows a three-dimensional model of the ribozyme of FIG. 4A bound to a cognate tRNA (tRNA^(Gly) _(GCC)), with the flexizyme module comprising a Fx3 flexizyme and the T-box comprising a glyQS T-box;

FIGS. 4E and 4F illustrates a comparison between the secondary structure models of a glsQS T-box (FIG. 4E) and the ribozyme of FIG. 4A (FIG. 4F);

FIG. 4G illustrates a representative model of an exemplary flexizyme of the present disclosure bound with a cognate tRNA, as well as additional detail on the components thereof;

FIG. 5 show results of a single-turnover kinetics study for dFx, with subpart (A) showing the 3,5-dinitrobenzyl ester (DBE)-activated alkynyl amino acid substrate used in the study; subpart (B) showing a representative gel image showing the accumulation of the slower-migrating band (charged tRNA^(Gly) _(GCC)) over time (gel stained with SYBR Green II); subpart (C) showing a graph of a reaction progress curve for dFx against tRNA^(Gly) _(GCC) U73A mutant (open square), tRNA^(Gly) _(GCC) (open diamond), or tRNA^(Ile) _(GAU) (open triangle);

FIG. 6 illustrates the sequences and secondary structures of tRNA constructs used in connection with the studies described in the present disclosure, with mutations highlighted with boxes;

FIG. 7 shows the results of aminoacylation assays using dFx (subpart A) and STARzyme (subpart B); controls provided that have intentional omission of the following reaction components: no dFx: dFx was omitted from the aminoacylation reaction; no alkynyl-aa: the DBE-activated alkyne-containing amino acid was omitted from the aminoacylation reaction; no azide: the Alexa Fluor 488 picolyl azide was omitted from the click reaction; full rxn: all reaction components were present; full rxn+centricon: same as “Full rxn,” except that the reaction sample was allowed to pass through an Amicon Ultra centrifugal filter unit with a 10 k NMWL cutoff to remove excess dye;

FIG. 8 shows the sequences and secondary structure models of possible flexizyme module variants that may be used in connection with embodiments of the ribozyme of the present disclosure;

FIG. 9 shows the results of studies on the binding specificity and catalytic activity of at least one exemplary embodiment of the ribozyme of the present disclosure, with subpart (A) showing four in vitro gel shift assays representative of the results with those on the left showing that the increasing T-box RNA or STARzyme concentration (“STARzyme” described in additional detail below) correlates with the decreasing band intensity corresponding to tRNA^(Gly) _(GCC), and the gels on the right showing that the band corresponding to the noncognate tRNA^(Ile) _(GAU) did not change with T-box RNA or STARzyme; subpart (B) showing reaction progress curves for STARzyme (STAR-A8-minus1) against tRNA^(Gly) _(GCC) U73A (open square), tRNA^(Gly) _(GCC) (open diamond) or tRNA^(Ile) _(GAU) mutant (open triangle); subpart (C) showing reaction progress curves for STARzyme against tRNA^(Ile) _(GCU) (solid diamond) or tRNA^(Ile) _(GCC) mutant (solid triangle) in comparison with tRNA^(Gly) _(GCC) U73A mutant and tRNA^(Ile) _(GAU); subpart (D) showing reaction progress curves for STARzyme G88A against tRNA^(Gly) _(GCC) U73A (open square) or tRNA^(Gly) _(GUC) U73A (solid square) (error bars stand for standard deviations from three independent experiments); subpart (E) showing two aminoacylation assays representative of the results (top) and a flow chart graphically showing that there was no activity identified of the mutant STARzyme-tRNA pair when the specifier sequence was mutated to be complementary to an amber suppressor tRNA and the anticodon loop of the gly tRNA was mutated to comprise an amber suppressor sequence; and subpart (F) shows a illustrative representation of the methodology of the in vitro assays performed;

FIG. 10 shows a flow-chart representative of at least one embodiment of a method for engineering and/or producing the ribozyme of FIG. 4A;

FIG. 11 shows a flow-chart representative of at least one embodiment of a method for incorporating an unnatural amino acid into a growing polypeptide chain (i.e. a protein) using the ribozyme of FIG. 4A

FIG. 12, subpart (A) shows the representative structure of a wild-type tyrS T-box and subpart (B) shows the representative structure of a tyrS T-box mutated to accept an amber suppressor tRNA and comprising SEQ ID NO: 3;

FIG. 13A shows a diagram illustrative of a method for producing a protein using the mutated tyrS T-box of FIG. 12 and FIG. 13B illustrates at least one embodiment of an in vitro expression system that may be utilized in performing one or more steps of such method;

FIG. 14 shows a gel image resulting from an in vitro aminoacylation assay performed using at least one variation of a ribozyme of the present disclosure comprising the mutated tyrS T-box of FIG. 12;

FIG. 15A illustrates a representation of full-length DHFR protein. The box marked TAG (I 115) indicates the position in the gene where a stop codon was introduced;

FIG. 15B shows the result of the in vitro translation assay using (a) a WT DHFR template, (b) no template, (c) the WT template in the presence of AMB-STAR and an amber suppressor tRNA, and (d) the mutant template in the presence of one variation of a ribozyme of the present disclosure, the artificial amino acid, and an amber suppressor tRNA;

FIGS. 16A-16C illustrate that DHFR protein can be synthesized in the presence of the artificial amino acid, at least one variation of a ribozyme of the present disclosure (e.g., a ribozyme comprising SEQ ID NO: 3), and a suppressor tRNA; namely, FIG. 16A shows an in vitro translation reaction in the presence or absence of all of the components required for incorporation of unnatural amino acids into a protein (presence or absence of components in each lane indicated in chart above gel, with a lack of such component indicated by a

symbol); FIG. 16B shows a schematic indicating that the artificial tRNA containing the amber suppressor can base pair to the STOP codon in the message RNA while naturally occurring E. coli tRNAs cannot; and FIG. 16C shows that removing release factor 1 (RF1) from the reaction allows translation through the STOP codon and generation of full length (FL) protein;

FIG. 17A illustrates predicted DHFR fragments generated by trypsin digestion;

FIG. 17B illustrates the predicted charge to mass ratio when various natural and the unnatural amino acid are incorporated into the peptide;

FIG. 17C illustrates incorporation of the unnatural amino acid at position 105 in the presence of the one variation of a ribozyme of the present disclosure, while the natural proline amino acid is detected in the wild type protein;

FIG. 18A illustrates a representation of a full-length DHFR protein, with the box marked TAG (L54) indicative of the position in the gene where a stop codon was introduced;

FIG. 18B illustrates the predicted charge to mass ratios of the peptide fragment containing amino acid 54 when the wild type, unnatural amino acid, or alternate amino acids are incorporated at position 54;

FIG. 18C illustrates the predicted results when a WT template, L54AMB (stop codon) template, or L54AMB (stop codon) template plus STARzyme are used in at least one embodiment of the in vitro translation system of the present disclosure; and

FIGS. 18D-F show graphical results that support peptides containing the artificial amino acid of interest are only observed when both the L54AMB template and a flexizyme of the present disclosure are both present within the reaction.

While the present disclosure is susceptible to various modifications and alternative forms, exemplary embodiments thereof are shown by way of example in the drawings and are herein described in detail.

DETAILED DESCRIPTION

For the purposes of promoting an understanding of the principles of the present disclosure, reference will now be made to the embodiments illustrated in the drawings and specific language will be used to describe the same. It will nevertheless be understood that no limitation of scope is intended by the description of these embodiments. On the contrary, this disclosure is intended to cover alternatives, modifications, and equivalents as may be included within the spirit and scope of this application as defined by the appended claims. As previously noted, while this technology may be illustrated and described in one or more preferred embodiments, the compositions, systems and methods hereof may comprise many different configurations, forms, materials, and accessories.

In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present disclosure. Particular examples may be implemented without some or all of these specific details and it is to be understood that this disclosure is not limited to particular biological systems, which can, of course, vary.

Various techniques and mechanisms of the present disclosure will sometimes describe a connection or link between two components. Words such as attached, linked, coupled, connected, and similar terms with their inflectional morphemes are used interchangeably, unless the difference is noted or made otherwise clear from the context. These words and expressions do not necessarily signify direct connections, but include connections through mediate components and devices. It should be noted that a connection between two components does not necessarily mean a direct, unimpeded connection, as a variety of other components may reside between the two components of note. Consequently, a connection does not necessarily mean a direct, unimpeded connection unless otherwise noted.

Furthermore, wherever feasible and convenient, like reference numerals are used in the figures and the description to refer to the same or like parts or steps. The drawings are in a simplified form and not to precise scale. It is understood that the disclosure is presented in this manner merely for explanatory purposes and the principles and embodiments described herein may be applied to devices and/or system components that have dimensions/configurations other than as specifically described herein. Indeed, it is expressly contemplated that the size and shapes of the composition and system components of the present disclosure may be tailored in furtherance of the desired application thereof.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of skill in the relevant arts. Although any methods and materials similar to or equivalent to those described herein can be used in the practice or testing of the subject of the present application, the preferred methods and materials are described herein. Additionally, as used in this specification and the appended claims, the singular forms “a”, “an” and “the” include plural referents unless the content clearly dictates otherwise. Thus, for example, reference to “a tRNA” includes a combination of two or more tRNAs; reference to “bacteria” includes mixtures of bacteria, and the like.

As used herein, the term “orthogonal” refers to a molecule (e.g., an orthogonal tRNA and/or an orthogonal aminoacyl tRNA synthetase) that is intended to function with endogenous components of a cell with reduced efficiency as compared to a corresponding molecule that is endogenous to the cell or translation system, or that fails to function with endogenous components of the cell.

As used herein, the term “cognate” means components that function together, for example, a tRNA and aminoacyl tRNA synthetase that preferentially aminoacylates that tRNA. These components can also be referred to as being “complementary.”

As used herein, the phrase “selector codon” refers to a codon recognized by an orthogonal tRNA in a translation process that is not typically recognized by an endogenous tRNA. Typical examples include stop codons, codons comprising four (4) or more bases, and/or the like. An orthogonal-tRNA anticodon loop recognizes a selector codon, e.g., in an mRNA, and inserts its amino acid into a polypeptide being translated by translation system components. For example, in at least one embodiment herein, the tRNA recognizes a selector codon such as an amber STOP codon and adds an uAA into a polypeptide being produced by the translation process. Selector codons can include, for example and without limitation, nonsense codons (such as STOP codons, e.g., amber, ochre, and opal codons), four (4) or more base codons, rare codons, codons derived from natural or unnatural base pairs, and/or the like.

As used herein, the phrase “suppressor tRNA” is a tRNA that alters the reading of a mRNA in a given translation system, for example, by providing a mechanism for incorporating an amino acid into a polypeptide chain in response to a selector codon. By way of a non-limiting example, a suppressor tRNA can read through a stop codon, a four (4) base codon, a rare, codon, etc.

As used herein, the phrase “translation system” refers to the components that incorporate an amino acid into a growing polypeptide chain (protein). Components of the translation system can include, for example, ribosomes, tRNAs, synthetases, mRNA, and the like. The novel tRNA and/or tRNA synthetases of the present disclosure can be added to or be part of an in vitro or in vivo translation system, for example, in a non-eukaryotic cell (e.g., a bacterium such as E. coli) or in a eukaryotic cell (e.g., a mammalian cell, a plant cell, a yeast cell, an algae cell, a fungus cell, an insect cell, and/or the like) or an organism.

As used herein, an “in vitro translation kit” refers to one or more components of a commercially available kit that contains most, if not all, of the reagents necessary to express proteins from DNA or RNA templates in a cell-free solution.

As used herein, the phrase “unnatural amino acid” or related abbreviation “uAA” refers to any amino acid, modified amino acid, and/or amino acid analogue that is not one of the twenty (20) common naturally occurring amino acids or the rare natural amino acids.

As used herein, the phrase “derived from” refers to a component that is isolated from or made using a specific molecule or organism, or information from the specific molecule or organism.

“A functional equivalent” of a sequence means RNA that has at least 50% homology with the RNA described by the sequences set forth herein and that have essentially the same properties, structure, and functionality. A number of studies have demonstrated that the functional equivalents of RNA sequences can be prepared by maintaining the base pairing of most of the double helical regions even when changes are made to the stems. Changes in the stem sequence does not significantly affect secondary structure, substrate specificity, and/or the functionality of the underlying structure; therefore, 100% sequence identity is not required in all cases to achieve the desired structure and functionality of the resulting molecule. Here, preferably, the homology amounts to 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 97%, 98%, 99% or more and maintains the structure and functionality of the original.

As used herein, the term “encodes” refers to any process whereby the information in a polymeric macromolecule or sequence string is used to direct the production of a second molecule or sequence string that is different from the first molecule or sequence string. As used herein, the term is used broadly and can have a variety of applications. In one aspect, the term “encode” describes the process of semi-conservative DNA replication, where one strand of a double-stranded DNA molecule is used as a template to encode a newly synthesized complementary sister strand by a DNA-dependent DNA polymerase. In another aspect, the term “encode” refers to any process whereby the information in one molecule is used to direct the production of a second molecule that has a different chemical nature from the first molecule. For example, as is well known in the art, a DNA molecule can encode an RNA molecule (e.g., by the process of transcription incorporating a DNA-dependent RNA polymerase enzyme). Also, an RNA molecule can encode a peptide, as in the process of translation. When used to describe the process of translation, the term “encode” also extends to the triplet codon that encodes an amino acid. In some aspects, an RNA molecule can encode a DNA molecule, e.g., by the process of reverse transcription incorporating an RNA-dependent DNA polymerase. In another aspect, a DNA molecule can encode a polypeptide, where it is understood that “encode” as used in that case incorporates both the processes of transcription and translation.

As used herein, the term “riboswitch” refers to a regulator segment of a mRNA molecule commonly found in the 5′-untranslated region, that binds another molecule (e.g., metabolites or metal ions as ligands) and regulates mRNA or protein expression by forming alternative structures in response to this ligand binding.

As used herein, the terms “T-box” and “T-box RNA” refer to a naturally occurring riboswitch that recognizes a cognate tRNA through interaction with the anticodon loop of that tRNA and can control gene expression (i.e. toggles it on or off) based on whether the aminoacyl group on the tRNA is present, i.e. the tRNA is charged. Generally, a T-box comprises three stem-loop structures that fold into a three-dimensional structure similar to that shown in FIG. 4D (200), and includes a three nucleotide sequence in a specific location designated the specifier sequence that recognizes the anticodon of the tRNA.

As used herein, the term “leader” refers to a wild-type or variant form of the upstream portion of a bacterial gene, particularly a bacterial gene that is regulated by a T-box termination/antitermination complex, and that comprises the elements of a T-box termination/antitermination complex, including a specifier sequence, a T-box, and conserved terminator and antiterminator consensus motifs.

“glyQS,” as used herein, refers to either a DNA molecule that comprises all or a portion of a bacterial glyQS gene (including the promoter and leader regions), and optionally all or a portion of the polynucleotide coding sequence encoding a wild-type or variant bacterial glycyl-tRNA synthetase, or an mRNA product encoded by the same.

“tyrS,” as used herein, refers to either a DNA molecule that comprises all or a portion of a bacterial tyrS gene (including the promoter and leader regions), and optionally all or a portion of the polynucleotide coding sequence encoding a wild-type or variant bacterial tRNA synthetase (i.e. tyrosyl-tRNA synthetase (TyrTS), or an mRNA product encoded by the same.

“Gram positive bacteria,” as used herein, means the phylogenetic group of bacteria commonly known as and referred to as the Gram-positive branch.

As used herein, the term “ribozyme” refers to RNA molecules that are capable of catalyzing specific biochemical reactions. Within the ribosome, ribozymes activities function as part of the large subunit ribosomal RNA to link together amino acids during protein synthesis to form protein chains.

As used herein, the term “flexizyme” refers to an artificial ribozyme that catalyzes the aminoacylation of tRNA with a chemically synthesized amino acid substrate. Flexizymes can accept a wide variety of amino acid substrates with diverse functional groups, opposite chirality, etc., but do not conventionally recognize tRNA with any specificity.

In order to generate a protein that contains a synthetic or unnatural amino acid (uAAs) therein, as previously described, conventional methods require the use of orthogonal pairs of an aminoacyl tRNA (ARS) and a tRNA that can function efficiently in the translational machinery, but that are “orthogonal” to the translation system at issue, meaning that the pairs function independently of the synthetases and tRNAs endogenous to the translation system itself. Desirable characteristics of the orthologous pair include tRNA that decode or recognize only a specific new codon, e.g., a selector codon, that is not decoded by any endogenous tRNA, and an ARS that preferentially aminoacylates (or charges) its cognate tRNA with only a specific amino acid. The orthogonal tRNA is also desirably not aminoacylated by endogenous synthetases. Generating such orthogonal pairs requires extensive engineering and often results in low or unsuccessful yields, especially in vivo in eukaryotes.

Here we report a novel artificial ribozyme molecule that is capable of both discriminating between tRNA molecules and effectively charging a cognate tRNA with an uAA in vitro and potentially in vivo in prokaryotes, archaea, and eukaryotes alike. To achieve this, the compositions and methods of the present disclosure uniquely combine the highly promiscuous nature of flexizymes with respect to amino acids and a tRNA-recognition module from T-box RNA. The resulting ribozyme results in a composition that can encode amino acids with specified codons in vivo without the use of conventional orthogonal pairs, adds specificity to traditional flexizyme characteristics in that it can both aminoacylate—or charge—tRNA with a wide variety of amino acid substrates (including chemically synthesized uAAs), does not require reengineering if a different amino acid substrate is used, and also exhibits significant specificity for a cognate tRNA (with determinants for tRNA recognition involving both the discriminator base and the anticodon of tRNA). Moreover, because it is an RNA molecule, the inventive ribozyme hereof can be delivered to a cell on a single plasmid along with its cognate tRNA substrates. As such, there is no need to translate the catalyst into a protein as is commonly seen in conventional systems. Site-specific incorporation of uAAs is a powerful tool in biological research and, as such, the inventive systems and methods provided herein that are not limited in terms of the chemical diversity of the uAAs which they can incorporate are a substantial improvement over conventional systems and methods in this field.

Now referring to FIG. 1A, a conventional flexizyme 10 bound to both a tRNA 12 and an amino acid 14 (here, amino acid dintrobenzyl ester) is shown for comparative purposes. Generally, a flexizyme 10 is an artificial ribozyme that catalyzes the aminoacylation of tRNA with a uAA or other amino acid 14. In relevant part, flexizymes 10 generally have a core composed of one irregular and three A-form helices, with the irregular helix comprising an active site 18 (or moiety binding pocket) for interacting with and/or binding amino acids 14. A hairpin bend near the 3′-terminus of the flexizyme 10 forms an acceptor stem 17 that allows the terminal three (3) nucleotides at the acceptor end 19 (GGU) to protrude away from the helical stack and makes them available for base pairing to the cytosine-cytosine-adenine (CCA)-terminus 16 of an tRNA substrate 12 (or other CAA sequences as described below).

The activity of flexizymes is not restricted by the functional group of the amino acid substrate, but, as stated above, its acceptor end 19 binds and acts on the CCA sequence 16 at the 3′-terminus end of the tRNA substrate 12. As CCA sequences 16 at the 3′-terminus end is a feature consistent across tRNA substrates, flexizymes 10 lack the ability to discriminate a cognate tRNA from noncognate tRNA. Accordingly, flexizymes 10 aminoacylate any tRNA substrate 12 with which it comes into contact. While this highly promiscuous nature is advantageous in that flexizymes can accept a wide variety of amino acid substrates with diverse functional groups and subsequently aminoacylate a tRNA substrate 12 with the same, it ultimately renders flexizymes 10 ineffective for cotranslationally incorporating uAAs (or other amino acids) into proteins using in vitro or in vivo translation systems. Indeed, for such systems to be effective, the aminoacyl synthetase used (or ARS-like ribozyme) must specifically bind one particular tRNA (such as, for example, an amber suppressor tRNA).

T-box RNAs 20, on the other hand, are naturally occurring riboswitches that can recognize a cognate tRNA through interaction with the anticodon loop 22 of the tRNA 12. Examples of the T-box mechanism are illustrated in FIG. 2, subparts (A) and (B), shown bound to tRNA 12 (both aminoacylated (subpart (A)) and uncharged (subpart (B))) for reference. The T-box mechanism regulates the expression of aminoacyl tRNA synthetase (ARS)-related genes in gram-positive bacteria. In this system, a riboswitch element in the upstream or “leader” region 24 of the nascent transcript monitors the relative amounts of charged versus uncharged species of a specific tRNA through direct binding of tRNA 12 by the leader RNA 24 at the specifier loop 26 thereof. Perhaps more specifically, binding a specific uncharged tRNA to a riboswitch element causes a structural change in the transcript that promotes expression of the downstream coding sequence.

As shown in subpart (B) of FIG. 2, when bound, uncharged tRNA 12 interact at both the specifier loop 26 (via anticodon loop 22 of the tRNA 12) and, at its acceptor end 27, with the bulged region in the antiterminator 30 as shown in subpart (B) of FIG. 2. This dual interaction stabilizes the antiterminator 30 and prevents the formation of a competing terminator helix, thereby enabling transcription to read through the termination site and into the downstream coding sequence. Conversely, when an aminoacylated tRNA 12 binds the T-box RNA 20, the presence of the amino acid 14 prevents interaction of the acceptor end 27 of the tRNA 12 with the antiterminator 30, thus ending transcription.

Unlike flexizymes, T-box RNA exhibit tRNA specificity at both of its binding sites and, as such, recognize the anticodon 22 and the CCA tail of cognate tRNA 12. Indeed, recognition of the tRNA 12 by the leader 24 may mimic recognition by an aminoacyl tRNA synthetase, which often exploits the anticodon 22 and discriminator 15 positions as specificity determinants.

FIG. 3A illustrates an alternative view of an uncharged tRNA 12 bound with a T-box RNA 20 that shows additional detail of the tRNA acceptor end 27 and T-box antiterminator 30 interaction. As shown in FIG. 3A, if the tRNA 12 is uncharged, the CCA-tail 16 at the 3′-terminus end of the bound tRNA 12 can base pair with the UGG sequence 32 upstream of the T-box riboswitch in a manner reminiscent of the interaction between the flexizyme 10 and a CCA-tail 16 of its substrate tRNAs 12 (see FIG. 1A). Notably, FIG. 3A also illustrates discriminator base 28, which will be referenced in further detail below in connection with T-box RNA 20 specificity.

Expanding from this identified interaction and its similarities with conventional flexizymes, the concept of building the catalytic activity of a flexizyme into T-box RNA was developed. In at least one embodiment of the present disclosure, a fusion ribozyme 300 is provided. FIG. 3B provides a high-level view of the conceptual interaction between the components of the fusion ribozyme 300. Generally, the fusion ribozyme 300 comprises catalytic/flexizyme module 10 linked and/or fused together with a T-box RNA module 20. The catalytic/flexizyme module 10 comprises a molecule based on a flexizyme and, as described herein, will hereinafter be referred to as the “flexizyme module.”

Both the flexizyme and T-box modules 10, 20 are capable of interacting with a tRNA 12, with the flexizyme module 10 catalyzing the aminoacylation thereof with a desired amino acid substrate 14 and the T-box module 20 providing a degree of tRNA 12 specificity. For example and as previously described, the acceptor end 19 of the flexizyme module 10 interacts with a CCA-tail 16 of a tRNA 12 and the T-box module 20 comprises at least a tRNA binding domain (specifier loop) 26 capable of interacting with the anti-codon 22 of a cognate tRNA substrate 12. Furthermore, as the flexizyme module 10 also comprises an active site 18 for binding an amino acid 14 for aminoacylation of the tRNA substrate 12. In this manner, when the fusion ribozyme 300 hereof is used in protein engineering and cotranslation techniques, it can select a specific tRNA 12 and charge it with a wide variety of amino acids 14 (artificial or otherwise) without any reengineering.

Various embodiments of the fusion ribozyme 300 are provided herein, the structures and variations of which will now be described in additional detail. Now referring to FIG. 4A, at least one exemplary embodiment of a ribozyme 400 is shown. The ribozyme 400 comprises a flexizyme module 100 fused with a T-box module 200 as described in connection with fusion ribozyme 300. While the T-box module 200 provides specificity to a particular tRNA substrate, the flexizyme module 100 has no specificity to side chains of uAA substrates and, as such, the novel ribozyme 400 hereof can put various uAAs to a specific tRNA.

FIG. 4A shows that the T-box and flexizyme modules 200, 100 of the ribozyme 400 are linked through the P1 stem of the flexizyme module 100. The resulting ribozyme 400 exhibits both exemplary tRNA-synthetase activity and significant specificity for a cognate tRNA. While the flexizyme module 100 of the ribozyme 400 may comprise any suitable type of flexizyme 100 now known or hereinafter developed (including, for example and without limitation, Fx3 (which has three bases) or any of its variants (e.g., Fx2-10)), in this at least one exemplary embodiment, the flexizyme module 100 starts with dinitro-flexizyme (dFx). dFx is a catalytic RNA that attaches an activated amino acid 14 to the 3′-CCA (or −0H) tail of a tRNA substrate 12. dFX uses its terminal 5′-GGU-3′ sequence to base-pair with the 5′-ACC-3′ sequence in tRNA from positions 73 to 75, which is particularly appropriate for sequences with A73 as the discriminator base. Accordingly, dFx is effective at aminoacylating virtually all tRNA species. Notwithstanding the foregoing, as noted above, it is understood that other artificial catalytic RNA may be used as the flexizyme 100 hereof, provided such catalytic RNA have the ability to aminoacylate various tRNAs with one or more amino acids and/or unnatural derivatives of such amino acids.

To create the ribozyme 400 of the present disclosure, the crystal structure of a T-box riboswitch 200 bound to a cognate tRNA 12 is employed, and the acceptor stem 27 of the bound tRNA 12 is superposed with acceptor stem analog 417 present in the crystal structure of the flexizyme 100, for example, Fx3, dFx, eFx, aFx, or the like. As described herein, the particular embodiment of the ribozyme 400 wherein the flexizyme module 100 comprises dFx is termed “STARzyme,” which is representative of the name “specific tRNA aminoacylating ribozyme” and comprises SEQ ID NO: 1 or a functional equivalent thereof, including eFx and aFx.

Notably, for the ribozyme 400 to be most effective at recognizing the anticodon 22 of a tRNA 12 and performing a chemical reaction at the acceptor stem 417 of the flexizyme 100, the orientation of the T-box module 200 with respect to the flexizyme module 100 should preferably be optimized. Indeed, through several iterative rounds of design, it was determined that a circular permutation of the flexizyme 100 and optimization of both the linker length and length of the connector helix can have a significant effect on the ribozyme's 400 specificity for a cognate tRNA 12. Accordingly, in at least one exemplary embodiment of the ribozyme 400, the flexizyme module 100 comprises a circular permutation of dFx (SEQ ID NO: 2). In such embodiments, the circularly permuted dFx module 100 may comprise seven (7), eight (8), or ten (10) nucleotide linkers 450 (see FIG. 4F). In the exemplary STARzyme embodiment, the flexizyme module 100 comprises a circular permutation of dFx and an eight (8) nucleotide linker.

FIG. 4A illustrates detail regarding the structure of a ribozyme 400, in at least one embodiment where the T-box module 200 comprises a tRNA^(Gly) _(GCC) specific T-box (glyQS T-box) from Geobacillus kaustophilus (Gkau) and the flexizyme module 100 comprises a circularly permuted version of the aminoacylating ribozyme dFx. While certain bacterial genera are referenced and/or described in connection with particular embodiments of the T-box of the present disclosure, one skilled in the art will appreciate that this is for illustrative purposes only and not intended to be limiting.

The Gkau glyQS T-box has a relatively minimal structure, is thermodynamically stable, and the crystal structure is readily available for reference; all characteristics that are beneficial in the current context. Notwithstanding the foregoing example, it will be understood that while dFx and the Gkau glyQS T-box (and other flexizymes and/or T-box RNAs) are used herein to describe the various embodiments of the present disclosure, such examples are not intended to be limiting. For example, in at least one alternative embodiment, the T-box module 200 comprises a tyrS T-box. Indeed, any appropriate flexizyme and/or T-box RNA may be used in connection with the novel ribozymes and related methods of the present disclosure, provided that, in at least certain embodiments, the T-box riboswitch and flexizyme selected do not bump or overlap when fused as described herein. This may be tested and/or confirmed with 3D modeling, activity assays, and/or similar techniques. It will be appreciated that, unless otherwise expressly set forth herein, the use of a particular type of T-box in this description is only by way of example and not intended to be limiting.

For comparison purposes, FIGS. 4B and 4C, respectively, show the flexizyme 100 (here, dFx, with FIG. 4C comprising SEQ ID NO: 2 or a functional equivalent thereof) in its linear and circular permutated versions (the catalytic activity remains in both versions which supports that, despite the inclusion of the polyA chain, the structure of the two permutations appears largely unaffected overall). The T-box RNA 200 (which, for example and without limitation, may comprise a glyQS, tyrosine, or any other type of T-box riboswitch) comprises nucleotides 426 for anticodon recognition (FIG. 4A) and flexizyme 100 comprises nucleotides 428 for binding the CAA tail 16 of a cognate tRNA bound to the T-box module 200 (not shown). In the at least one embodiment shown in FIG. 4A, the sequence GAG was also added to the 5′-end of the ribozyme 400 to facilitate efficient in vitro transcription of the ribozyme by T7 RNA polymerase; it is not required for ribozyme activity.

Finally, flexizyme 100 further comprises an active site 418 for binding the designated amino acid 14 (including, without limitation, a uAA). This model demonstrates that the T-box riboswitch 200 and the flexizyme 100 do not sterically interfere with each other when docked to a tRNA and, as such, support their combined use in an engineered ribozyme 400 with tRNA-synthetase activity (see FIG. 1, subpart (A), in particular).

Now referring to FIG. 4D, a 3D image of a starting model for tRNA recognition by the ribozyme 400 is shown (PDB ID 4 MGN and 3CUL). As per FIG. 4D, the visible portions of the ribozyme 400 comprise stem I of the T-box module 200 (here, glyQS), a bound cognate tRNA 12 (here, tRNA^(Gly) _(GCC)), the flexizyme module 100 (here, Fx3), a specifier loop 426 of the T-box RNA 200, an anticodon 422 of the tRNA substrate 12, and the CAA tail 16 from a superposed acceptor stem analog 417 within the flexizyme 100 structure (with the remainder of the acceptor stem analog 417 structure not shown for the sake of simplicity). At least in this embodiment, stems II and III of the T-box module 200 were not specifically modified, with the spacer region enabling the tRNA to bind both stem 1 and the antiterminator region.

FIGS. 4E and 4F provide a side-by-side comparison of how closely the structure of the fusion ribozyme 400 (here, illustrated as being tRNA^(Gly)-specific) of the present disclosure mimics a glyQS T-box with respect to the antiterminator topology.

FIG. 4G provides a model of the structure of at least one embodiment of ribozyme 400 bound to a cognate tRNA 12, as well as additional detail of the component parts 100, 200 of the ribozyme 400. Subpart (1) of FIG. 4G illustrates the schematic of the tyrS T-box 200 in its natural context as a gene regulatory element (i.e. where it turns the downstream gene on and off depending on the acylation state of the bound tRNA 12), subpart (2) shows a model of the three dimensional structure of the ribozyme 400, and subpart (3) shows the flexizyme 100 as it was initially created.

In support of the novel principles and compositions set forth herein, several studies were performed on the ribozymes 300, 400 hereof to confirm and optimize their novel functionality.

The Isolated dFx Flexizyme Exhibits Specificity for the Discriminator Base of tRNA

To verify that the ribozyme 400 (and, in particular, the isolated flexizyme 100 thereof) exhibits the desired specificity for the discriminator base 28 of the tRNA 12 when the T-box riboswitch 200 and the flexizyme 100 are fused, the catalytic activity of the flexizyme 100 as it relates to specific tRNA substrates 12 was tested in connection with an activated amino acid substrate carrying an alkynyl group (see FIG. 5, subpart (A)). Perhaps more specifically, the observed rate constant (k_(obs)) of aminoacylation reaction on both tRNA^(Gly) _(GCC) and tRNA^(Ile) _(GAU) substrates were measured through single-turnover kinetic assays (see FIG. 5, subparts (B) and (C)) (e.g., mix 10 μM of refolded ribozyme 400 with 2 μM of refolded tRNA; add saturating akynyl-DBE to initiate the reaction; incubate the reaction at 4° C. and take aliquots at desired time points; and perform acid gel electrophoresis followed by SYBR-green II staining). The sequence and secondary structure of the tRNA substrates used are shown in FIG. 6, with mutations (where applicable) highlighted by boxes.

The results indicate that the flexizyme 1000 (here, dFx) charged tRNA^(Gly) _(GCC) with k_(obs)=0.2±0.040 h⁻¹. In comparison, the flexizyme 100 charged tRNA^(Ile) _(GAU) with k_(obs)=0.6±0.080 h⁻¹ (see Table 1), a 3-fold, and statistically significant, difference.

TABLE 1 Kinetic parameters for dFx and STARzymes. Parameters reported herein represent average values from the three independent measurements. Errors stand for standard deviations. k_(obs) (h⁻¹), f_(max), tRNA^(Gly) _(GCC) tRNA^(Gly) _(GCC) k_(obs), k_(obs), Ribozyme U73A U73A k_(obs) (h⁻¹), f_(max), k_(obs) (h⁻¹), f_(max), cognate/k_(obs), cognate/k_(obs), construct (cognate) (cognate) tRNA^(Ile) _(GAU) tRNA^(Ile) _(GAU) tRNA^(Gly) _(GCC) tRNA^(Gly) _(GCC) tRNA^(Ile) _(GAU) tRNA^(Gly) _(GCC) dFx 0.54 ± 0.082 0.75 ± 0.016 0.60 ± 0.080 0.80 ± 0.047  0.20 ± 0.041  0.86 ± 0.0099 0.90 2.7 STAR-A8 0.21 ± 0.015  0.57 ± 0.0077 0.055 ± 0.0032 0.50 ± 0.015 0.067 ± 0.015 0.50 ± 0.15  3.8 3.1 STAR-A8-  0.16 ± 0.0066  0.55 ± 0.0061 0.047 ± 0.0094 0.40 ± 0.044 0.065 ± 0.017 0.44 ± 0.072 3.4 2.5 minus1 STAR-A10 0.17 ± 0.048 0.55 ± 0.021 0.090 ± 0.029  0.52 ± 0.13  0.085 ± 0.012 0.45 ± 0.041 1.9 2.0 STAR-A7 0.092 ± 0.015  0.49 ± 0.095 0.026 ± 0.0061 0.46 ± 0.15   0.034 ± 0.0061 0.52 ± 0.057 3.5 2.7

It is known that aminoacylation is most productive when the discriminator base 28 at position 73 immediately preceding the CCA-tail of the tRNA 12 is complementary to the 3′-terminal U of Fx3 (which is equivalent to U46 in dFx). tRNA^(Ile) _(GAU) has a discriminator base A and can form a Watson-Crick base pair with U46 of dFx, while tRNA^(Gly) _(GCC) has as U at the discriminator base position 28 and therefore cannot make a base pair with U46 of dFx when used as the flexizyme 100. Accordingly, the variation between the k_(obs) values observed in connection with the specificity tests is consistent with the previous study where aminoacylation yields were compared between tRNAs having different discriminator bases 28.

In view of this, additional tests were performed to ascertain if the reactivity variation observed between the two tRNAs was accurately attributable to the difference in discriminator base 28 or some other factor. In furtherance of this, a U73A mutation was introduced into tRNA^(Gly) _(GCC) (see FIG. 6). This mutation rescued the ribozyme activity (k_(obs)=0.54±0.080 h⁻¹ for tRNA^(Gly) _(GCC) U73A mutant versus k_(obs)=0.60±0.080 h⁻¹ for tRNA^(Ile) _(GAU)), thus indicating that U-to-A mutation restores flexizyme 100 activity (at least when the flexizyme comprises dFx) and supporting discriminator base 28 specificity.

FIG. 5, subpart (C) displays reaction progress curves for dFx against tRNA^(Gly) _(GCC) U73A mutant (open square), tRNA^(Gly) _(GCC) (open diamond), or tRNA^(Ile) _(GAU) (open triangle), as does Table 1. As maximal activity of the fusion ribozyme 400 is expected to require Watson-Crick base-pairing at both the discriminator base 28 and the anticodon loop 22 of the tRNA, the tRNA^(Gly) _(GCC) U73A mutant was used as the cognate tRNA for the remaining studies.

tRNA Binding Activity of the T-box and Fusion Ribozymes

To characterize the tRNA binding properties of the T-box RNA 200 and the fusion ribozymes 400 of the present disclosure, native gel band shift assays were performed using a previously established protocol for the Gkau glyQS T-box riboswitch 200. Isolated Gkau glyQS T-box riboswitch RNA 200 was used as a positive control (labeled as “T-box”) and tRNA^(Ile) _(GAU) was used as a negative control (labeled as “tRNA^(Ile)”). A linear ribozyme 400 was created by connecting the 3′-terminus of the T-box module 200 to the 5′-terminus of the dFx module 100 via a 5-to 8-nt poly-A linker 450 (see FIG. 4F). Secondary structure prediction by Mfold supported correct folding of both modules 100, 200 within the ribozyme 400.

T-box RNA or the T-box-dFx linear ribozyme 400 was added to cognate tRNA^(Gly) _(GCC) A73U or non-cognate tRNA^(Ile) _(GAU). As shown in FIG. 9A, the addition of up to 6-fold molar excess of T-box or fusion RNA gradually shifted the cognate tRNA^(Gly) _(GCC) A73U, thus the band corresponding to the tRNA disappeared as the riboswitch 200 or ribozyme 400 was added. (Note that no new band for the complex of tRNA with the riboswitch 200 or fusion RNA 400 was observed and, while it is not clear as to why a supershift was not observed under these conditions, it is possible the tRNA complex was smeared or comigrated with another band.) Conversely, the results clearly demonstrated that the band corresponding with the noncognate tRNA^(Ile) _(GAU) remained unshifted and did not change in intensity following the addition of the riboswitch 200 or linear fusion 400. These results indicate that the T-box module 200 embedded in the fusion construct 400 is functional and that it provides the fusion ribozyme 400 with tRNA binding specificity.

Circular Permutation of dFx

The catalytic activity of the linear fusion RNA 400 was also explored using the single turnover assay previously described in connection with the dFx-ribozyme 400. While the linear ribozyme 400 was catalytically active, its activity was lower than that observed for dFx (a circular permutation). Further, the linear permutation of the flexizyme 100 did not distinguish the cognate tRNA from noncognate tRNA (data now shown). The tRNA-binding properties and the presence of catalytic activity support that the T-box and flexizyme modules 200, 100 are both folded and functional. However, the lack of discrimination and diminished catalytic activity suggest that this embodiment of the linear flexizyme module 100 cannot simultaneously bind to the CCA-tail 16 of the tRNA 12 when the tRNA 12 is docked to the T-box riboswitch module 200.

Referring back to FIG. 4D, the working 3D model of the ribozyme 400 supports that a circular permutation of the flexizyme 100 (created by linking the 5′- and 3′-terminus ends and opening the P1 loop) provides advantageous topology for fusing the two modules 100, 200 within the ribozyme 400. Although disrupting the structure of the CCA-binding site 19 on the single stranded tail at the 3′-terminus end of the flexizyme 100 is problematic as any structural disruption would be detrimental to ribozyme 400 activity, it was determined that a number of adenosines can be used to successfully link the 5′-terminus of the parental flexizyme molecule 100 with the 3′-terminus containing the CCA-binding site 19 to provide flexibility without deleterious effect (see FIG. 4B). Indeed, at least with respect to dFx flexizymes 100, this insertion did not interfere with formation of the native secondary structure.

Accordingly, in at least one exemplary embodiment of the present disclosure herein referred to as the STARzyme, the 3′-end of the glyQS T-box 200 may connect to the new 5′-end of the circularly permuted flexizyme 100 (here, dFx). As comparatively illustrated in FIGS. 4E and 4F, the overall topology of this embodiment of ribozyme 400 (FIG. 4F) resembles that of the wild-type glyQS T-box 200 (FIG. 4E), where the aptamer domain 30 is connected to a helix (A1 in the glyQS T-box and P1 in STARzyme) that is then linked to a bulge containing the binding motif for the CCA tail.

Detection of the tRNA-Linked Amino Acid

While the activity assay previously described employed a gel mobility assay to monitor the aminoacylation of tRNA, further steps were performed to confirm that the shift in gel mobility was due to the addition of an uAA to the tRNA. To achieve this, covalent modification through ‘click’ chemistry was employed. The tRNA^(Gly) _(GCC) A73U mutant was aminoacylated using a dFx flexizyme 100 (to form the STARzyme as referred to herein) for a period expected to provide the maximal yield of tRNA product. The reaction mixture was treated with Alexa Fluor 488 Picolyl Azide, which contains a fluorophore and an azide group. This reagent specifically targets the aminoacyl group on the tRNA with a fluorescent probe, which specifically labels the tRNAs that have been aminoacylated by the ribozyme 400, while the ribozyme 400 itself and unreacted product remain untouched.

Reactions and controls were divided in two and loaded side-by-side on a single polyacrylamide gel, which was subsequently cut into halves. The first half was stained with SYBR-Green II to allow for visualization of all RNA products and the second half was left unstained. The two half gels were reconstituted prior to scanning.

A fluorescently labeled band was observed at the molecular weight corresponding to the aminoacylated tRNA product only when all of the required components of the aminoacylation reaction were present within the mixture (see FIG. 7, subparts (A) and (B)). When small molecules were removed from the reaction by ultracentrifugation, only a single fluorescently labeled band—again, corresponding to the size of the aminoacylated tRNA product—was observed. This data supports that the ribozyme 400 is in fact catalyzing the addition of uAAs to a tRNA substrate 12. Note that in the SYBR-Green II stained gel, the putative reaction product (i.e. the charged tRNA) migrates higher upon the addition of Alexa Fluor 488-tagged picolyl azide. The identity of the “supershifted” band was further confirmed by the detection of Alexa Fluor 488 fluorescence in the unstained portion of the gel. Upon ultrafiltration, the single fluorescent band (lane #11) on the right half of the gel comigrates with the “supershifted” band on the left half of the gel (lane #4 or lane #5).

Optimizing the Catalytic Activity of the Ribozyme

Various variants of the ribozyme 400 may be engineered to optimize its tRNA specificity and/or catalytic activity. For example, linker 450 size has been found to contribute to tRNA specificity of the ribozyme 400. Accordingly, in at least circular permutations of the flexizyme 100, the inventive ribozyme 400 hereof may be engineered to comprise a specific number of nucleotide linkers 450 within its flexizyme module 100.

Embodiments comprising seven (7), eight (8), and ten (10) nucleotide linkers 450 in circularly permuted dFx flexizyme module 100 were analyzed for catalytic activity in the presence of either cognate or noncognate tRNAs. Again, the tRNA^(Gly) _(GCC) A73U mutant is considered cognate because it can base pair with both the specifier nucleotides 426 of a T-box 200 and nucleotides at acceptor end 419 of flexizyme 100 (e.g., U46 of dFx).

An embodiment of ribozyme 400 with the flexizyme 100 comprising dFx and an eight (8) nucleotide linker 450 was only 2,6-fold less reactive than isolated dFx in the presence of the cognate tRNA. The observed rate constant for this embodiment of ribozyme 400 was about 0.21±0.015 h⁻¹, while—in comparison—isolated dFx exhibits a rate constant of 0.54±0.082 h⁻¹. Although the reactivity of the ribozyme 400 is slightly lower than that of the isolated dFx ribozyme 400, STAR-A8 was 3.8-fold less active in the presence of the noncognate tRNA^(Ile) _(GAU) (k_(obs)=0.055±0.0032 h⁻¹) as compared to when in the presence of the cognate tRNA^(Gly) _(GCC) U73A (see Table 1).

The eight (8) nucleotide linker 450 proved most advantageous—if not optimal—with respect to the ribozyme 400 gaining the most tRNA specificity as compared at least to the STARzyme with a ten (10) nucleotide linker 450, which displayed only a 1.9-fold difference in the observed rate constants for cognate and non-cognate tRNAs (k_(obs)=0.17±0.048 h⁻¹ for the cognate tRNA versus k_(obs)=0.090±0.029 h⁻¹ for tRNA^(Ile) _(GAU)). While the STARzyme with a seven (7) nucleotide linker 450 exhibited a similar difference in rate constants for cognate versus non-cognate tRNA (k_(obs)=0.092±0.015 h⁻¹ for the cognate tRNA versus k_(obs)=0.026±0.0061 h⁻¹ for tRNA^(Ile) _(GAU)), the catalytic activity of the ribozyme 400 was severely compromised (see Table 1). In sum, the difference in ribozyme 400 activity seen between the cognate tRNA^(Gly) _(GCC) U73A mutant and the non-cognate tRNA^(Ile) _(GAU) supports that the ribozyme 400 hereof uses the specifier nucleotides 426 in the T-box module 200 to recognize the anticodon loop 22 of the tRNA 12 and, as such, gains specificity.

In addition to studying the effect of increasing or decreasing the number of linkers 450 present within a flexizyme module 100, the discriminator base's 28 role as a specificity determinant was also assessed by characterizing the ribozyme's 400 reaction to wild-type tRNA^(Gly) _(GCC). Similar to the results observed for a dFx-flexizyme 100, the STARzyme-flexizyme 100 was 2- to 3-fold less active with the wild-type tRNA^(Gly) _(GCC) than with the tRNA^(Gly) _(GCC) U73A mutant (Table 1). Accordingly, the ribozyme 400 clearly achieves tRNA discrimination through recognition of both the anticodon 22 and discriminator base 28 of the tRNA 12. This is clearly seen in FIG. 9B where maximal activity requires base-pairing at the anticodon (compare squares with triangles) and base-pairing at the discriminator base (compare squares with diamonds).

Now referring to FIG. 8, the T-box and flexizyme modules 200, 100 of the fusion ribozyme 400 are linked together through the P1 stem of the flexizyme 100 (also see FIG. 4A in connection with dFx). It has been determined that changing the length of the P1 helix has a significant impact on the activity of the ribozyme 400 due to the geometrical constraints of the helix itself. Indeed, addition or removal of even a single base pair from P1 of the flexizyme module 100 as shown in FIG. 8 not only changes the length of the connector, but also the orientation of the flexizyme 100 active site 418 with respect to the T-box module 200 due to the twist of the helix. Accordingly, to ensure the interplay between the flexizyme module 100 and the T-box module 200 is optimized, the effects of mutations on non-cognate tRNA binding was assessed.

Wild-type tRNA^(Ile) _(GAU) can potentially make two base pairs (one Watson-Crick and one wobble) with the specifier nucleotides 426, GCC, from the T-box module 200. The first mutant is tRNA^(Ile) _(GCU), which contains a single A-to-C mutation at position 35. tRNA^(Ile) _(GCU) was predicted to form two G●U wobble pairs with the specifier 426 trinucleotide GGC. The second mutant is tRNA^(Ile) _(GCC), where the anticodon 22 sequence has been changed from GAU to GCC to restore the specifier-anticodon base pairing. Single-turnover kinetics were measured using these two mutants as substrates and k_(obs) values were compared to those of the wild-type tRNA^(Ile) _(GAU) and the cognate tRNA.

As shown in FIG. 9, subparts (A)-(E), as compared to the wild-type tRNA^(Ile) _(GAU), both mutants exhibited faster kinetics with about a 2-fold increase in k_(obs) (see also Table 2 below). However, the rate and aminoacylation yield was noticeably lower than that seen with the cognate tRNA (FIG. 9, subpart (C): tRNA^(Ile) _(GCU) (solid diamond), tRNA^(Ile) _(GCC) mutant (solid triangle), tRNA^(Gly) _(GCC) U73A (open square), tRNA^(Ile) _(GAU) mutant (open triangle)). Accordingly, the specificity of the flexizyme 100 correlates well with the number of Watson-Crick base pairs between the tRNA anticodon loop 22 (three nucleotides) and the discriminator nucleotide 28 at position 73.

FIG. 9, subpart (E) shows that the glyQS T-box 200 will efficiently aminoacylate a cognate tRNA 12, but not the non-cognate. However, as represented in the flow chart at the bottom of FIG. 9, subpart (E), when the specifier sequence 426 was mutated to be complementary to an amber suppressor tRNA and the anticodon 22 loop of the gly tRNA was mutated to comprise an amber suppressor sequence, there was no activity identified of the mutant STARzyme-tRNA pair. These results support that the STARzyme variant based on the glyQS T-box does not successfully introduce an unnatural amino acid into a protein when used with currently available in vitro translation kits. FIG. 9, subpart (F) shows the components of the reaction used to produce the results shown in FIG. 9, subpart (E).

Maximal activity was not observed with any of the tRNA^(Ile) variants, even those that were fully complementary to the STARzyme 100 at their anticodon 22 and discriminator 28 positions. The data suggests that, as observed in the crystal structure of the T-box module 200 bound to tRNA 12, there are additional interactions between the elbow regions of tRNA^(Gly) and the T-box module 200 of the flexizyme 100 that are not present when tRNA^(Ile) is bound. This is also consistent with the lack of binding observed between the flexizyme 100 and the tRNA 12, which is an important features in the design of an effective orthogonal system.

TABLE 2 Kinetic parameters for STARzyme with GGC or GAC specifier sequence in the presence of a different tRNA constructs as indicated. Specifier Anti- sequence tRNA construct codon k_(obs) (h⁻¹) f_(max) GGC tRNA^(Gly) _(GCC) U73A match  0.16 ± 0.0066  0.55 ± 0.0061 GGC tRNA^(Ile) _(GAU) mis-  0.047 ± 0.0094  0.40 ± 0.044 match GGC tRNA^(Ile) _(GCU) match  0.11 ± 0.041  0.47 ± 0.047 GGC tRNA^(Ile) _(GCC) match 0.092 ± 0.025  0.38 ± 0.090 GAC tRNA^(Gly) _(GCC) U73A mis- 0.088 ± 0.022  0.43 ± 0.038 match GAC tRNA^(Gly) _(GUC) U73A match  0.14 ± 0.025  0.56 ± 0.022 Parameters reported here represents average values from three independent measurements. Errors stand for standard deviations. Programming Specificity into the Flexizyme

Finally, the ability to change the anti-codon binding specifier region 426 of the T-box module 200 in affect a change in its specificity for a tRNA substrate 12 was assessed. A tRNA mutant was designed based on the body and structure of tRNA^(Gly) (see FIG. 6). Perhaps more specifically, an anticodon 22 triplet of tRNA^(Gly) _(GCC) A73U was mutated to GUC, which corresponds with an aspartate codon to generate the tRNA^(Gly) _(GUC) mutant. Accordingly, the specifier trinucleotide 426 of tRNA^(Gly) _(GCC) A73U was mutated from GGC to GAC (STARzyme G88A), which was complementary to the anticodon 22 triplet of the tRNA^(Gly) _(GUC) A73U mutant.

Kinetic assays were performed to test if the ribozyme 400—and, in particular, the STARzyme G88A specifier mutant—had switched the specificity of the ribozyme 400 to favor tRNA^(Gly) _(GUC) U73A over the cognate substrate. In the presence of the original tRNA^(Gly) _(GUC) U73A construct, the STARzyme G88A mutant 400 reacted with a k_(obs) value of 0.088±0.02 and results in a yield of about ˜43%. (see FIG. 11, subpart (D), and Table 2). Both parameters were significantly lower than those obtained from the wild-type (WT) STARzyme 400, suggesting that it is necessary to have perfect complementarity between the anticodon 22 and the specifier 426 to achieve full aminoacylation activity. In the presence of the tRNA^(Gly) _(GUC) U73A mutant, however, the STARzyme G88A mutant 400 reacts with a k_(obs) value of 0.14±0.025 and f_(max) is about 56%. Both parameters are the same, within errors, as those obtained from the WT ribozyme 400 in the presence of the original tRNA^(Gly) _(GCC) U73A construct. Accordingly, the data suggest that it is possible to change the specificity of the ribozyme 400 for its tRNA substrate 12 in a predictable manner.

Materials and Methods

The various materials and methods used for preparing the variants and performing the studies described herein are as follows and are additionally well known in the art:

RNA Sample Preparation. DNA oligos for PCR reactions were ordered from IDT. DNA templates for ribozymes 300, 400 and tRNAs were inserted into pUC-19 between the XbaI and HindIII restriction sites. RNA samples were made by in vitro transcriptions, and purified by urea denaturing gel electrophoresis as previously described.

In vitro gel shift assays. tRNA, the antiterminator-deleted glyQS T-box RNA, and STARzyme samples were diluted into 10 μM stocks in 50 mM HEPES-KOH (pH 7.5) and 100 mM KCl. To refold, RNA samples were heated separately at 95° C. for about 2 min, and slowly cooled to room temperature over about 10 min. Thereafter, 10 mM of MgCl₂ was added to the sample followed by an additional 5-min incubation at room temperature. 0, 1, 3 or 6 μM of the refolded T-box RNA or STARzyme were mixed with 1 μM of refolded tRNA samples in the presence of 50 mM HEPES-KOH pH 7.5, 100 mM KCl and 10 mM MgCl₂. The 20-μL binding reactions were incubated at room temperature for 30-60 min followed by the addition of 5 μL 50% glycerol. 5 μL of the samples (out of 25 μL) were then loaded onto a 6% native polyacrylamide gel (0.5× Tris/borate/EDTA buffer, 5 mM MgCl₂). The gel was run at 4° C. at 10 W for 1.5 hour. The gel was stained with Stains-all (Sigma) overnight at 4° C. Gel images were obtained using Bio-Rad ChemiDoc XRS+.

Single-turnover kinetics. 10 μM of ribozyme and 2 μM of tRNA sample were heated separately at 95° C. for 2 min in a buffer containing 50 mM pH 7.5 HEPES-KOH and 100 mM KCl. Samples were slowly cooled to room temperature over 10 min. 20 mM MgCl₂ was then added to each sample, followed by ˜10 min incubation at room temperature. The ribozyme 300, 400 was then mixed with tRNA and the reaction solution was incubated at room temperature for about 1 h. Reaction solution was then incubated at 4° C. for 10 min.

To start the reaction, 5 mM DBE-activated alkynyl amino acid substrate was added. At each time point, 2 μL of the reaction aliquot was removed and quenched with 8 μL of acid loading dye that contains 100 mM sodium acetate pH 5.2, 7 M urea, 0.05% bromophenol blue and 10 mM EDTA. Reaction products were resolved by 8% acidic denaturing (7 M urea) gel buffered with 100 mM sodium acetate pH 5.2. The gel was stained with SYBR Green II (Life technologies). Fluorescent signal was detected using Typhoon FLA 9500 (GE Healthcare), and the intensity of fluorescence over time was measured using ImageQuant TL (GE Healthcare). Rate constants were obtained by fitting the fraction of aminoacylated tRNA versus time using the single-exponential equation (Synergy KaleidaGraph V4.1):

f _(t) =f ₀+(f _(max)-f ₀)(1-e ^(−k) ^(obs) ^(t)),

where f_(t) is the fraction of tRNA charged at time t, f₀ is the fraction of tRNA charged at time zero that corrects for the background noise due to insufficient separation between the charged and uncharged tRNA (see FIG. 11 for detail), f_(max) is the fraction of tRNA charged at infinite time and was used in this paper to represent maximal aminoacylation yield, k_(obs) is the first-order rate constant.

Labelling aminoacylated tRNA with a fluorophore. Two 20-pt aminoacylation reactions containing 10 μM of ribozyme and 2 μM of tRNA sample were prepared and initiated as described above for the activity assays. The reactions were incubated at 4° C. for about 6 hours (dFx) or about 24 hours (STARzyme) to achieve maximal aminoacylation yield of the tRNA. After the incubation, the RNA was precipitated by the addition of 40 μL of 0.3 M pH 5.2 sodium acetate and 100-μL of room temperature ethanol. RNA samples were spun down by centrifugation at 4° C. for 25 min at 13,200 rpm. The pellet was washed twice with 20-μL of 70% ethanol containing 0.1 M NaCl, followed by brief centrifugation. Copper-catalyzed click reactions were performed using a Click-iT Plus Alexa Fluor 488 Picolyl Azide Toolkit (ThermoFisher Scientific).

Briefly, a 500-μL reaction cocktail containing 5μM of Alexa Fluor 488 picolyl azide and 10 μL of the provided copper protectant was prepared following the manufacture's protocol. No CuSO₄ was added to the cocktail to minimize RNA degradation. The precipitated RNA pellet was then dissolved in 20 μL of the reaction cocktail. The click reaction was incubated in dark for 30 min. In order to remove the “unclicked” azide, the 20-μL click reaction was diluted into 500-μL by adding 480-μL of 100 mM sodium acetate at pH 5.2. The 500-μL solution was then concentrated to ˜20-μL by using an Amicon Ultra centrifugal filter unit with a 10K NMWL cutoff. The process was then repeated once to further remove any residual azide. The click reaction sample was then mixed with equal volume of colorless acid loading buffer that contains 100 mM sodium acetate pH 5.2, 7 M urea, 0.05% and 10 mM EDTA. Gel electrophoresis was performed as described in the above section. To analyze the click reaction product, unstained gels were scanned using Typhoon FLA 9500 (GE Healthcare).

Production of the Ribozyme

Methods for engineering, producing, and using the inventive fusion ribozymes described herein are also a feature of the present disclosure. Now referring to FIG. 10, a flow-chart illustrative of a method 1100 for engineering and/or producing a ribozyme 400 is shown. At step 1102, a starting molecule is created by fusing a flexizyme module 100 (to facilitate acceptance of a wide variety of amino acid substrates) with a tRNA-specific T-box module 200 (to create specificity for the tRNA). In at least one exemplary embodiment of the method 1100, the flexizyme module 100 comprises dFx and the T-box module 200 comprises a Gkau glyQS T-box. Nevertheless, despite the specific examples provided herein, one of skill in the art will appreciate any number of different flexizymes and/or bacterial T-box riboswitches may be used depending on the desired application and the inventive concepts with respect to the systems and methods hereof applies equally.

Depending on the desired structure of the ribozyme 400, in at least one exemplary embodiment of the method 1100, step 1102 may comprise either sub-step 1102 a or 1102 b. Perhaps more specifically, where a linear fusion ribozyme 400 is desired, at sub-step 1102 a, the 3′-terminus of the T-box module 200 is connected to the 5′-terminus of a flexizyme module 100 via a 5 to 10 nucleotide poly-A linker 450 with the secondary structure presenting within both modules 100, 200 following the fusion at sub-step 1102 a. This creates a linear permutation of the ribozyme 400.

Alternatively, if a circularly permuted topology of the flexizyme is desired, step 1102 may comprise linking the 3′-terminus of the T-box module 200 to the 5′-terminus of a flexizyme module 100 and opening the P1 loop of the flexizyme 100 at sub-step 1102 b. To maximize functionality, sub-step 1102 b may optionally comprise the incorporation of one or more adenosines to facilitate the linkage between the 5′-terminus of the parental flexizyme 100 and the 3′-terminus of the T-box RNA 200 that comprises the CCA-binding site 419 of the flexizyme. In this embodiment, the 3′-terminus of the T-box module 200 may then be connected to the new 5′-terminus of the circularly permuted flexizyme 100.

At step 1104, cognate tRNA 12 is bound to the T-box module 200 and, at step 1106, tRNA 12 is bound to the flexizyme module 100. For example, in at least one exemplary embodiment, at step 1104 the anticodon 22 of the tRNA 12 binds the specifier loop 426 of the leader portion 24 of the T-box RNA 200 and, at step 1106, the acceptor end 27 (3′-terminus) of the bound tRNA 12 is superposed over the acceptor stem analog 417 present in the crystal structure of the flexizyme 100. As the T-box 200 exhibits tRNA specificity, the ribozyme 400 will only allow tRNA 12 it specifically recognizes (cognate) to bind. Further, at step 1108, an amino acid (e.g., a uAA) is bound to the active site 418 of the flexizyme 100.

It will be appreciated that steps 1104, 1106, and/or 1108 may be performed concurrently or in any order or sequence. Furthermore, at least steps 1104 and 1108 may be performed before or after step 1102, as desired. It will be appreciated that the tRNA 12 and/or an amino acid 14 may be bound with the ribozyme 400 (or any component thereof) at any point within the method 1100 or thereafter.

In at least one embodiment, method 1100 may further comprise optimization step 1110. As previously described, the interplay between the flexizyme module 100 and the T-box module 200 of the ribozyme 400 may be engineered to achieve optimal catalytic activity and/or improved specificity (e.g., through in vitro evolution by randomly introducing mutations into the ribozyme 400 and selecting for those that have the highest activity and strongest selectivity for cognate tRNA). This may be achieved in a number of ways, for example, by changing the length of the P1 stem of the flexizyme 100 and/or the size of the linker 450. Accordingly, at step 1110, compensatory mutations may be introduced to the CCA tail 16 of the tRNA 12 and/or internal guide, base pairs may be added to or removed from P1 stem (e.g., removing one A●U base pair therefrom), and/or the number of nucleotides present within the linker 450 of a circularly permuted flexizyme 100 molecule may be optimized (e.g., the linker 450 may be engineered to comprise eight (8) nucleotides).

Additionally or alternatively, the ribozyme 400 may be further manipulated to alter the tRNA substrate 12 specificity thereof at step 1112. For example, the sequence of the anti-codon binding specifier loop 426 of the ribozyme 400 may be modified to change its specificity for the anticodon loop of the tRNA substrate 12 using standard methods to introduce site-directed mutation (e.g., to specifically aminoacylate an amber (UAG) suppressor tRNA 1252 as described in further detail below). It will be appreciated that the specifier loop 426 may be modified as desired provided that it is complementary to the anticodon triplet 22 of the desired cognate tRNA 12. Accordingly, step 1112 allows for the specificity of the ribozyme 400 for its desired cognate tRNA substrate 12 to be modified in a predictable and effective manner. Notwithstanding the foregoing, it is notable that the substrate specificity of the ribozyme 400 with respect to an amino acid substrate 14 need not be manipulated as the flexizyme module 100 does not discriminate between amino acids 14 and can thus charge a bound tRNA 12 with any desired amino acid 14—even if that amino acid 13 is an uAA.

Now referring to FIG. 11, a method 1200 of incorporating a uAA into a growing polypeptide chain using the ribozyme 400 hereof is also provided. Such method 1200 may be performed in vivo in a cell (e.g., a non-eukaryotic cell, such as an E. coli cell or the like, or a eukaryotic cell) and/or in in vitro applications, as desired. For the avoidance of doubt, as used herein, the term “cell” shall refer to and mean either an in vivo or in vitro application unless expressly indicated otherwise. Method 1200 enables the synthesis of proteins that comprise uAAs in large and useful quantities, which is exceedingly beneficial due to the number of important applications for proteins that include uAA(s).

Generally, the incorporation of a uAA into a protein can be done to tailor changes in the protein structure and/or function, such as, for example, to change its size, acidity, nucleophilicity, hydrogen bonding, hydrophobicity, accessibility of protease target sites, target to a moiety (e.g., for a protein array), etc. Furthermore, proteins that include a uAA can have enhanced or even entirely new catalytic or physical properties (e.g., toxicity, structural properties, biodistribution, chemical and/or photochemical properties, spectroscopic properties, catalytic ability, half-life, ability to react with other molecules (e.g., covalently or noncovalently), and the like). As such, the compositions including such proteins are useful in a vast number of contexts including, for example, for novel therapeutics, diagnostics, industrial enzymes, catalytic enzymes, binding proteins (e.g., antibodies), as well as in the study of protein structure and function. Additionally, one or more uAAs may be incorporated into a polypeptide to provide a molecular tag.

As previously described, the ribozyme 400 comprises a single molecule that includes a discriminating binder element (i.e. the T-box module 200) capable of binding a cognate tRNA 12 with specificity, an indiscriminate amino acid 14 binder element (i.e. the active site 418 of the flexizyme module 100), and an ARS-like catalyzing element for catalyzing the addition of an amino acid 14 to a tRNA 12 substrate (i.e. the flexizyme module 100). The ribozyme 400 can be used in connection with a translation system (either in vivo or in vitro) to incorporate a uAA 14 into a protein.

At step 1202 of the method 1200, the ribozyme 400 and the desired substrates 12, 14 are delivered to a cell. Because the ribozyme 400 is an RNA molecule, the ribozyme 400 can be delivered to the cell on a single plasmid along with its suppressor tRNA and amino acid substrates 12, 14. Furthermore, if introduced into a living cell, use of the ribozyme 400 bypasses the need to translate the catalyst component into a protein. As is known in the art, the specific cognate tRNA 12 chosen will coincide with the specialized codon within the mRNA (e.g., an amber STOP codon) positioned at the location where the uAA 14 is to be incorporated within the polypeptide chain. Additionally, it will be appreciated that the ribozyme 400 may be used in place of—or in conjunction with conventional orthogonal systems.

At step 1204, the ribozyme 400 binds the suppressor tRNA 12 and, at step 1206, charges the bound tRNA 12 with the uAA 14. Namely, at step 1204, the specifier loop 426 of the T-box module 200 binds the anticodon 22 of the tRNA 12 (with specificity) and the acceptor end 419 of the flexizyme module 100 binds the CAA tail 16 of the bound tRNA 12. Due to the specificity provided by the T-box module 200, the ribozyme 400 specifically recognizes and aminoacylates its targeted suppressor tRNA 12 without the risk of crosstalk with any endogenous tRNA (or synthetases) present (within a living cell or otherwise).

Further, at step 1206, the active site 418 of the flexizyme module 100 binds the uAA 14. Note that the active site 418 of the flexizyme module 100 exhibits high substrate promiscuity with respect to amino acid substrates 14 and will thus will have a high degree of effectiveness in binding (and using in connection with aminoacylation) most amino acid substrates 14 added to the cell.

The ribozyme 400 catalyzes aminoacylation of the bound tRNA substrate 12 at step 1208. Perhaps more specifically, the flexizyme module 100 of the ribozyme 400 adds the bound uAA 14 to the bound tRNA substrate 12, thereby charging the tRNA 12 and releasing the same from the ribozyme 400. At step 1210, the charged tRNA 12 is transferred to the ribosome and used in ribosomal translation.

If, during the method 1200, a change in amino acid 14 is desired, the method 1200 may advance to step 1212, where the new amino acid 14 is simply added to the cell. Indeed, because the flexizyme module 100 (i.e. catalytic component) of the ribozyme 400 can accommodate a wide variety of amino acid substrates 14 at its active site 418, in most cases, it need not be reengineered if a different amino acid 14 is to be used. Indeed, the new substrate 14 can simply be added the method 1200 will repeat, using the new amino acid 14.

Methods for using the Ribozyme for Specific Incorporation of Amino Acids

As previously stated, the inventive systems and methods of the present disclosure are capable of selective, codon-specific recognition of the targeted tRNA substrate 12 and charging that tRNA substrate 12 with a desired amino acid 14. In at least one embodiment, the concepts hereof may be applied to fuse a flexizyme module 100 with a T-box module 200 that has been specifically engineered to accept a particular tRNA substrate 12 (using, for example, step 1112 of method 1100). The resulting ribozyme 400 may be used in prokaryotic and/or eukaryotic expression systems to easily and cost-effectively incorporate the desired tRNA substrate 12 into a protein.

In at least one non-limiting example, the T-box module 200 of ribozyme 400 is constructed in vitro to specifically aminoacylate an amber suppressor tRNA 1252 that recognizes tRNA^(Tyr). While, as previously described the gly T-box module was not wholly successful in incorporating a specific uAA into a protein when used with convention in vitro translation kits, use of a tyrS T-box module did result in the successful incorporation of a specific amino acid into a protein using the various methods described herein.

As shown in FIG. 12, while a wild-type tyrS T-box module 200 comprises a UAC codon in the specifier loop 426 (see subpart A of FIG. 12), a mutant tyrS T-box module 1250 can be constructed at the specifier loop 426 that is complimentary to the amber suppressor tRNA 1252 anticodon 422 triplet CUA (see subpart B of FIG. 12) (as used herein, such embodiments of ribozyme 400 is referred to as the “AMB-STARzyme 1280”). In at least one exemplary embodiment in accordance with the foregoing description, the ribozyme 400 comprises SEQ ID NO: 3 or a functional equivalent thereof. The T-box mutations can be achieved using methods well-known in the art such as, without limitation, those described by F. J. Grundy in 2000 and/or complete synthesis of the gene.

Here, the mutation at the specifier loop 426 comprises UAG; however, other mutants can be constructed and used as desired. For example, there are at least four identity elements that determine the orthogonality of tRNA^(Tyr)—the G1C72 base pair, the discriminator base A73 (discriminator base 28), the variable loop, and the anticodon sequence (anticodon 422). If tRNA^(Tyr) is the tRNA substrate 12/1252 of interest, the T-box module 200 is thus mutated at one or more of its corresponding locations to compliment the tRNA^(Tyr). Accordingly, in at least one embodiment, tRNA^(AMB) _(CUA) deletes the UU in the variable loop and instead inserts AA therein. By way of explanatory example, Table 3 identifies the effects of mutations in the identity elements of Eco-tRNA^(TYR) on charging by Eco-TyrRS^(a).

Now referring to FIG. 13A, a schematic of a translation reaction using an amber suppressor tRNA 1252 and the STARzyme to insert an unnatural amino acid into a protein is shown. To facilitate understanding FIG. 13A will now be used for reference in describing at least one embodiment of a method 1300 for applying the AMB-STARzyme 1280/amber suppressor tRNA 1252 conjugate pair in a bacterial expression system pursuant to the present disclosure. Notably, at least steps 1310 and 1312 of the method 1300 may be performed in vitro or in vivo. Certain steps of method 1300 overlap with one or more steps of previously described methods 1100, 1200 and, where that is the case, the relevant step(s) of methods 1100, 1200 are indicated in parentheses for context. While this method 1300 is described in conjunction with the AMB-STARzyme 1280 and an amber suppressor tRNA 1252, one of ordinary skill in the art will appreciate that method 1300 may be used with any ribozyme of the present disclosure (e.g., without limitation, 400 and/or 1280), cognate tRNA 12, and amino acid 14 (natural or unnatural) to generate a desired protein in large-scale quantities.

At step 1302, an amber suppressor tRNA 1252 comprising the anticodon 422 CUA is provided to a translation system and combined with the AMB-STARzyme 1280 (which, as shown in FIG. 13A, may comprise a mutant T-box module 1250 having an UAG triplet at the specifier loop 426 and, in at least one embodiment, was prepared using at least steps 1104 and 1106 of method 1100). At step 1304 (which may be performed simultaneously with step 1302), an activated amino acid substrate 14 (e.g., acyl-DBE) is added to the AMB-STARzyme 1280 to initiate the reaction (step 1108) to aminoacylate the tRNA 1252 such that the DBE group from the activated amino acid substrate 14 is released. The combination of the AMB-STARzyme 1280 with the activated amino acid substrate 14 at step 1304 may occur after, prior to, or concurrent with combining the AMB-STARzyme 1280 and the amber suppressor tRNA 1252 at step 1302. At step 1306 the amber suppressor tRNA 1252 is charged with the amino acid 14 to produce a charged suppressor tRNA 1340. Steps 1302-1304 may be repeated as necessary to produce a desired amount of charged suppressor tRNA 1340, with each AMB-STARzyme 1280 being reusable to bind a new activated amino acid substrate 14 and aminoacylate the next tRNA 1252.

At step 1308 (akin to step 1202 of method 1200), the charged suppressor tRNA 1340 is delivered to a ribosome 400 of a prokaryote (or, in other embodiments, a cell of a eukaryote). As indicated above, either in vivo or in vitro methodologies may be used to perform step 1308.

When the charged suppressor tRNA substrates 1340 are brought to a site on the ribosome 400 containing the next codon to be translated (the A-site) at step 1308, they may also be exposed to elongation factor or other reagents for conducting in vitro translation (e.g., present within a commercially obtained in vitro translation kit). In prokaryotic cells, prokaryotic elongation factor thermo unstable (EF-Tu) catalyzes translation of the charged suppressor tRNA substrates 1340 by the ribozyme 400, while in eukaryotic cells the elongation factor comprises eEF-1 and/or eEF-2 (but has the a like function). It will be appreciated that, as used herein, “elongation factor” refers to the appropriate type of elongation factor relevant to the underlying cells, whether prokaryotic or eukaryotic and may be a component of an in vitro translation kit.

The elongation factor facilitates the selection and binding of the charged suppressor tRNA 1340 to the A-site of the ribosome 400. At step 1310, the charged suppressor tRNA 1340 passes through and is read by the ribosome 400, which translates the same into a peptide fragment 1352 corresponding with the sequence from the mRNA 1340 until it gets signals to stop. The resulting peptide 1354, comprising the amino acid 14 incorporated therein at a specific site, is released at step 1312 and folds into its dictated 3D geometry.

It will be appreciated that, similar to other components of the inventive systems described herein, the amber suppressor tRNA 1252 is not orthogonal. Instead, the amber suppressor tRNA 1252 can be charged with any amino acid 14 of interest (natural or artificial) through use of the novel ribozymes 400, 1280 and methods 1100, 1200, 1300 hereof.

Furthermore, a commercial in vitro protein synthesis kit 1370 is employed to facilitate a cell-free expression system (for example, and without limitation, a PURExpress™ protein synthesis kit). As shown in FIG. 13B, the amber suppressor tRNA (template), the AMB-STARzyme 1280, ribozymes 400, an in vitro translation kit 1382, and the amino acids 14 are all combined in a tube 1375 (steps 1302-1310). In at least one embodiment, the tRNA template may comprise a plasmid encoding the DHFR protein and containing a stop codon as shown in FIG. 15A, for example. Additional compounds may also be added as desired and depending on the target, such as his-tagged factors, small molecule fluorophores, and/or metal affinity resin (not shown). For example, if the goal is to localize particular proteins within cells, small molecule fluorophores may be added. Such systems 1370 are simple to optimize for a particular application target and may be utilized for one, some, or all of the steps of method 1300.

In addition the foregoing, the use of expression systems 1370 provide numerous other benefits. Expression systems 1370 often do not require proteases or nucleases, which simplifies the overall process. Furthermore, when in vitro expression kits are used, release factor 1 (RF1)—which typically binds to an amber stop codon and suppresses translation—can be removed. The unique nature of the ribozymes 400, 1280 hereof and related method 1300 are particularly suited for use with such expression systems 1370 thus adding to the overall commercial and research value thereof.

This technology can be used to generate simple kits and reagents to study a broad range of biological processes. Indeed, the practical applications of the inventive ribozymes 400, 1280 of the present disclosure are vast and include, for example, measuring the impact of post-translational modifications on histones and other proteins, localization of proteins in cells with tags and labels, the identification of dynamic interaction partners with cross-linkers, and more. Because the ribozymes 400, 1280 may be readily synthesized by in vitro transcription, it can be made in large quantities for use in bioreactors and the like, even where mutated/specific T-box modules 200 are desired. Accordingly, the inventive ribozymes 400, 1280 of the present disclosure can be used to generate large-scale quantities of proteins for use as therapeutics, structural and biophysical studies, or any other application that requires a large amount of protein.

Related to the foregoing, at least one exemplary embodiment, the method 1300 may further comprise optional step 1314. At step 1314, the resulting peptide 1354 is analyzed to identify and/or analyze a target present therein using an appropriate methodology (such as, for example, gel electrophoresis, SDS-PAGE, Western Blot, size exclusion chromatography, affinity chromatography, isotope coded protein labelling (ICPL), matrix-assisted laser desorption/ionization (MALDI) time of flight (TOF) mass spectrometry, and the like, and/or any combination thereof). For example, where the amino acid 14 incorporated therein comprises a tag, the analysis performed at step 1314 may comprise a fast mix-and-measure immunoassay or the like. Additionally or alternatively, the analysis may comprise, SDS-PAGE separation, followed by cutting the product bands, in-gel trypsin digestion, and MALDI-TOF mass spectrometry to identify if uAAs 14 sufficiently incorporated at the desired amber stop codon. It will be appreciated that step 1314 may include any analysis method of biomolecules that may be appropriate (or combination thereof) desired to analyze an intended target or characteristic of the peptide 1354 and the examples provided herein are in no way limiting.

Examples and Results

To verify the inventive concepts presented herein, method 1300 was performed using a AMB-STARzyme 1280 designed to specifically aminoacylate an amber suppressor tRNA 14 in vitro. Perhaps more specifically, the AMB-STARzyme 1280 was prepared comprising a circularly permuted flexizyme module 100 comprising SEQ ID NO: 2 (or a functional equivalent thereof) and a T-box module 200 comprising a Bacillus subtilis tyrS T-box mutant tyrS T-box module 1250 comprising SEQ ID NO: 3. Here, the B. subtilis AMB-STARzyme 1280 variant comprising SEQ ID NO: 3 was specifically constructed to aminoacylate amber suppressor tRNA^(tyr) 1252.

To verify that the AMB-STARzyme 1280 exhibits the desired specificity for the amber suppressor tRNAs 1252 (cognate) and noncognate tRNAs, six 20-μL aminoacylation reactions (4 wells each), each containing 10 μM of a AMB-STARzyme 1280 variant and 2 μM of tRNA sample (cognates or noncognates), were prepared and initiated in vitro as described above for the activity assays using an expression system 1370. The reactions were incubated at 37° C. and aliquots were taken at the desired time points. At each time point, 2 μL of the reaction aliquot was removed and quenched with 8 μL of acid loading dye containing 100 mM sodium acetate pH 5.2, 7 M urea, 0.05% bromophenol blue and 10 mM EDTA. Reaction products were resolved by 8% denaturing (7 M urea) gel buffered with 100 mM sodium acetate pH 5.2. The gel was stained with SYBR Green II (Life technologies).

FIG. 14 shows the results of an in vitro aminoacylation assay that was performed on the resulting products to identify the concentration of charged tRNA and uncharged tRNA (not aminoacylated) present in each well. The results indicate that both variants of the AMB-STARzyme 1280 successfully charged tRNA^(Tyr) _(CUA) ^(AMB) with specificity, but were unreactive with the noncognates (tRNA^(Tyr) _(GUA) ^(WT), tRNA^(Gly) _(GCC) ^(U73A), tRNA^(Ile) _(GAU) ^(WT)). Furthermore, the longer the incubation time, the higher the concentration of aminoacylated cognates present within the wells.

Now referring to FIGS. 15A and 15B, the inventive systems and methods of the present disclosure were used to produce a full-length dihydrofolate reductase (DHFR) (bacterial species) with an artificial amino acid at position 115 (labeled as “TAG” in FIG. 15A to identify the position in the gene where the stop codon was introduced).

Perhaps more specifically, a version of method 1300 was employed utilizing an amber stop codon template and AMB-STARzyme 1280 of at least one embodiment of the present disclosure. The PURExpress ΔRF123 kit (NEB) was used and an in vitro translation reaction with a total volume of 30 μl was performed for each sample. The components were added to a tube in the following order: 10 μl of solution A, 7.5 μl of solution B, 0.5 μl of RF2, 0.5 μl of RF3, 0.5 μl of RNase inhibitor (NEB), 0.5 μl of 360 mM magnesium acetate, 0.5 μl of 120 mM ATP, amber suppressor tRNA (a final concentration of 27 μM), AMB-STARzyme (a final concentration of 66 μM), sufficient RNase-free water to make a total volume of 29.5 μl and lastly, 0.5 μl of 600 ng/μl DNA template encoding DHFR containing a STOP codon at the indicated site in the gene. The reactions were incubated for 3 hours at 37° C.

FIG. 15B shows the result of the in vitro translation assay, comparing the WT DHFR template (column (a)), no template (column (b)), the WT DHFR template in the presence of AMB-STAR and an amber suppressor tRNA (column (c)), and the mutant template in the presence of a variation of a ribozyme of the present disclosure, the artificial amino acid, and an amber suppressor tRNA (column (d)). The results clearly illustrate that an in vitro translation system can function in the presence of the AMB-STARzyme 1280 of the present disclosure. Indeed, out of all of the subjects tested, the AMB-STARzyme 1280 produced the highest concentration of full-length DHFR, further evidencing the effectiveness of the inventive ribozyme in aminoacylation with specificity.

Thereafter, the resulting peptide was further analyzed to verify the incorporation of the amino acid 14 of interest. The following three groups were each analyzed independently using SDS-PAGE separation, followed by cutting the product bands, performing in-gel trypsin digestion, and MALDI-TOF mass spectrometry: (1) wild-type template only; (2) tRNA substrate 12 (amber stop codon template) only; and (3) tRNA substrate 12 (amber stop codon template), STARzyme 1280, and DBE-alkynyl (uAA 14). The results were then compared to determine if the DBE-alkynyl (uAA 14) incorporated at the desired amber stop codon, or if a natural amino acid misincorporated at the amber stop codon.

FIGS. 16A-16C illustrate that DHFR protein can be synthesized in the presence of the artificial amino acid, one variation of a ribozyme of the present disclosure, and a suppressor tRNA. Negative controls missing at least one component (see highlighted boxes) indicate there is some misincorporation of natural amino acids at position 115. As shown in FIG. 16C, in the presence of release factor 1, full length DHFR protein cannot be synthesized (labeled FL); however, removing release factor 1 allows readthrough of the stop codon and full length DHFR to be synthesized (see arrow). Trypsin digest of DHFR containing the artificial amino acid is predicted to create a peptide fragment with a m/z=5818.5138 value.

FIG. 17A lists the predicted trypsin fragments generated when using a stop codon template constructed by mutating the position corresponding to proline 105 to a TAG stop codon. FIG. 17B illustrates the change in charge to mass ratio when the fragment containing position 105 contains the wild type, unnatural, or alternate misincorporated amino acids at position 105. FIG. 17C supports that the mass of the fragment containing position 105 is shifted to reflect incorporation of the unnatural amino acid in the presence of the P105AMB substrate and the STARzyme, thus validating the findings hereof.

FIGS. 18A-18C illustrate that the peptide fragment containing position 54 (labeled Fragment X in FIG. 17A) shows a high signal. FIG. 18A illustrates a DNA template encoding the protein DHFR with a mutation at the position corresponding to L54 to generate a stop codon at that position (L54AMB). FIG. 18B illustrates the predicted charge to mass ratios of the peptide fragment containing amino acid 54 when the wild type, unnatural amino acid, or alternate amino acids are incorporated at position 54. FIG. 18C illustrates the predicted results when WT template, L54AMB template, or L54AMB plus a flexizyme of the present disclosure are used in the inventive in vitro translation systems hereof. FIGS. 18D-F support that peptides containing the artificial amino acid are only observed when both the L54AMB template and a flexizyme of the present disclosure are both present in the reaction.

Accordingly, unlike conventional techniques, the compositions and methods of the present disclosure provide a streamlined and easy-to-use process for incorporating uAAs into protein biosynthesis. While various embodiments of compositions, systems, and methods hereof have been described in considerable detail, the embodiments are merely offered by way of non-limiting examples. Many variations and modifications of the embodiments described herein will be apparent to one of ordinary skill in the art in light of the disclosure. It will therefore be understood by those skilled in the art that various changes and modifications may be made, and equivalents may be substituted for elements thereof, without departing from the scope of the disclosure. Indeed, this disclosure is not intended to be exhaustive or to limiting. The scope of the disclosure is to be defined by the appended claims, and by their equivalents.

Further, in describing representative embodiments, the disclosure may have presented a method and/or process as a particular sequence of steps. However, to the extent that the method or process does not rely on the particular order of steps set forth herein, the method or process should not be limited to the particular sequence of steps described. As one of ordinary skill in the art would appreciate, other sequences of steps may be possible. Therefore, the particular order of the steps disclosed herein should not be construed as limitations on the claims. In addition, the claims directed to a method and/or process should not be limited to the performance of their steps in the order written, and one skilled in the art can readily appreciate that the sequences may be varied and still remain within the spirit and scope of the present disclosure.

It is therefore intended that this description and the appended claims will encompass, all modifications and changes apparent to those of ordinary skill in the art based on this disclosure. 

1. An artificial ribozyme comprising: a T-box element comprising a 3′-terminus and a specifier loop for binding a cognate tRNA substrate with specificity; and a flexizyme comprising dinitro-flexizyme, Fx3, or a variant of Fx3 and defines at least an active site for binding an amino acid, an acceptor end for base pairing to a terminus of a tRNA substrate, a nucleotide linker, and a P1 stem linked to the 3′-terminus of the T-box element; wherein when the T-box element recognizes and preferentially binds the cognate tRNA substrate, the flexizyme can bind and aminoacylate the cognate tRNA substrate bound by the T-box element, and the active site of the flexizyme is not specific to one or more targeted amino acids.
 2. The artificial ribozyme of claim 1, wherein the T-box element further comprises a mutation that is complimentary to an anticodon of the cognate tRNA substrate.
 3. The artificial ribozyme of claim 2, wherein the mutation is at the specifier loop of the T-box element and the cognate tRNA substrate comprises an amber suppressor tRNA.
 4. The artificial ribozyme of claim 2, wherein the flexizyme further comprises SEQ ID NO: 2, SEQ ID NO: 3, or a functional equivalent of either SEQ ID NO: 2 or SEQ ID NO:
 3. 5. The artificial ribozyme of claim 3, wherein the mutation at the specifier loop comprises a stop codon comprising an uracil-adenine-guanine nucleotide sequence.
 6. The artificial ribozyme of claim 1, wherein the one or more targeted amino acids comprises a natural amino acid, an unnatural amino acid, or both.
 7. The artificial ribozyme of claim 2, wherein the mutation is at a location in the T-box element that corresponds with an identity element for determining a degree of orthogonality of the cognate tRNA substrate.
 8. A synthesis kit for producing peptides that incorporate one or more amino acids at one or more specific sites, the kit comprising: a first cognate tRNA substrate; a first artificial ribozyme comprising the artificial ribozyme of claim 1, wherein the T-box element of the first artificial ribozyme recognizes and preferentially binds the first cognate tRNA substrate; a first targeted amino acid; and one or more reagents for conducting in vitro translation; wherein at least one of a first group consisting of the first artificial ribozyme, the first cognate tRNA substrate, and the first targeted amino acid are housed in a first container separate from the other members of the first group.
 9. The synthesis kit of claim 8, wherein the T-box element of the first artificial ribozyme further comprises a mutation from wild-type and the mutation is complimentary to an anticodon of the first cognate tRNA substrate.
 10. The synthesis kit of claim 8, wherein the T-box element of the first artificial ribozyme further comprises a mutation from wild-type at the specifier loop, the first cognate tRNA substrate comprises an amber suppressor tRNA, and the mutation of the T-box element is complimentary to an anticodon of the amber suppressor tRNA.
 11. The synthesis kit of claim 8, wherein the flexizyme of the first artificial ribozyme comprises SEQ ID NO: 2, SEQ ID NO: 3, or a functional equivalent of either SEQ ID NO: 2 or SEQ ID NO:
 3. 12. The synthesis kit of claim 8, further comprising a second cognate tRNA substrate and a second artificial ribozyme comprising the artificial ribozyme of claim 1, wherein the T-box element of the second artificial ribozyme recognizes and preferentially binds the second cognate tRNA substrate.
 13. The synthesis kit of claim 8, wherein the first targeted amino acid comprises an unnatural amino acid.
 14. The synthesis kit of claim 12, further comprising a second targeted amino acid and wherein at least one of a second group consisting of the second artificial ribozyme, the second cognate tRNA substrate, and the second targeted amino acid are housed in a second container separate from the other members of the second group.
 15. A method for synthesizing peptides that incorporate one or more amino acids at a specific site, the method comprising the steps of: providing a tRNA substrate, a first amino acid, and an artificial ribozyme, the artificial ribozyme comprising: a T-box element comprising a 3′-terminus and a specifier loop for binding the tRNA substrate with specificity, and a flexizyme comprising dinitro-flexizyme, Fx3, or a variant of Fx3 and defines at least an active site for binding an amino acid, an acceptor end for base pairing to a terminus of the tRNA substrate, a nucleotide linker, and a P1 stem linked to the 3′-terminus of the T-box element, wherein when the T-box element recognizes and preferentially binds the tRNA substrate, the flexizyme can bind and aminoacylate the tRNA substrate bound by the T-box element, and the active site of the flexizyme is not specific to a targeted amino acid; aminoacylating the tRNA substrate with the first amino acid to produce a first aminoacylated tRNA substrate by: combining the artificial ribozyme with the first amino acid such that the active site of the flexizyme binds with the first amino acid, combining the artificial ribozyme with the tRNA substrate such that the specifier loop of the T-box element binds, with specificity, an anticodon on the tRNA substrate and a 3′-terminus of the tRNA substrate binds the acceptor end of the flexizyme, and reacting the bound tRNA substrate with the bound first amino acid; and delivering the first aminoacylated tRNA substrate to a ribosome.
 16. The method of claim 15, wherein at least the steps of aminoacylating the tRNA substrate and delivering the first aminoacylated tRNA substrate to a ribosome are performed in vitro and the T-box element comprises SEQ ID NO: 3 or a functional equivalent thereof.
 17. The method of claim 16, wherein the step of delivering the first aminoacylated tRNA substrate to a ribosome comprises exposure to one or more reagents for conducting in vitro translation.
 18. The method of claim 15, further comprising the steps of: providing a second amino acid; aminoacylating the tRNA substrate with the second amino acid to produce a second aminoacylated tRNA substrate by: combining the artificial ribozyme with the second amino acid such that the active site of the flexizyme binds with the second amino acid, combining the artificial ribozyme with the tRNA substrate such that the specifier loop of the T-box element binds, with specificity, an anticodon on the tRNA substrate and a 3′-terminus of the tRNA substrate binds the acceptor end of the flexizyme, and reacting the bound tRNA substrate with the bound second amino acid; and delivering the second aminoacylated tRNA substrate to a ribosome.
 19. The method of claim 15, wherein the T-box element of the artificial ribozyme comprises a mutation that is complimentary to an anticodon of the tRNA substrate and comprises a tyrS T-box.
 20. The method of claim 19, wherein the mutation is at the specifier loop of the T-box element, the tRNA substrate comprises an amber suppressor tRNA, and the first amino acid comprises an unnatural amino acid. 